Book of Proof 
Third Edition 


Richard Hammack 


Published by Richard Hammack 
Richmond, Virginia 


Book of Proof 
Edition 3.3 
© 2018 by Richard Hammack 


This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivative 
4.0 International License 


goes 


Typeset in 11pt TEX Gyre Schola using PDFIATEX 


Cover by R. Hammack. The cover diagrams are based on a geometric construction that 
renders a correct perspective view of an object (here an octagonal column) from its floor 
plan. The method was invented by Piero della Francesca 1415-1492, a Renaissance painter 
and mathematician. 


To my students 


Contents 


Preface 


Introduction 


I Fundamentals 


1. Sets 


1.1. 
1.2. 
1.3. 
1.4. 
1.5. 
1.6. 
1.7. 
1.8. 
1.9. 


Introduction to Sets 

The Cartesian Product 

Subsets 

Power Sets 

Union, Intersection, Difference 
Complement 

Venn Diagrams 

Indexed Sets 

Sets That Are Number Systems 


1.10. Russell’s Paradox 


2. Logic 


2.1. 
2.2. 
2.3. 
2.4. 
2.5. 
2.6. 
2.7. 
2.8. 
2.9. 


Statements 

And, Or, Not 

Conditional Statements 

Biconditional Statements 

Truth Tables for Statements 

Logical Equivalence 

Quantifiers 

More on Conditional Statements 
Translating English to Symbolic Logic 


2.10. Negating Statements 
2.11. Logical Inference 
2.12. An Important Note 


3. Counting 


3.1. 
3.2. 
3.3. 
3.4. 
3.5. 
3.6. 
3.7. 
3.8. 
3.9. 


Lists 

The Multiplication Principle 

The Addition and Subtraction Principles 
Factorials and Permutations 

Counting Subsets 

Pascal’s Triangle and the Binomial Theorem 
The Inclusion-Exclusion Principle 

Counting Multisets 

The Division and Pigeonhole Principles 


3.10. Combinatorial Proof 


104 
108 


II 


III 


How to Prove Conditional Statements 


4. Direct Proof 


4.1. Theorems 

4.2. Definitions 

4.3. Direct Proof 

4.4, Using Cases 

4.5. Treating Similar Cases 


5. Contrapositive Proof 


5.1. Contrapositive Proof 
5.2. Congruence of Integers 
5.3. Mathematical Writing 


6. Proof by Contradiction 


6.1. Proving Statements with Contradiction 

6.2. Proving Conditional Statements by Contradiction 
6.3. Combining Techniques 

6.4. Some Words of Advice 


More on Proof 


7. Proving Non-Conditional Statements 


7.1. If-and-Only-If Proof 

7.2. Equivalent Statements 

7.3. Existence Proofs; Existence and Uniqueness Proofs 
7.4. Constructive Versus Non-Constructive Proofs 


8. Proofs Involving Sets 


8.1. How to Proveace A 
8.2. How to Prove ACB 
8.3. How to Prove A=B 
8.4. Examples: Perfect Numbers 


9. Disproof 


9.1. Counterexamples 
9.2. Disproving Existence Statements 
9.3. Disproof by Contradiction 


10. Mathematical Induction 


10.1. Proof by Induction 

10.2. Proof by Strong Induction 

10.3. Proof by Smallest Counterexample 

10.4. The Fundamental Theorem of Arithmetic 
10.5. Fibonacci Numbers 


113 


113 
115 
118 
124 
125 


128 


128 
131 
133 


137 


138 
141 
142 
143 


147 


147 
149 
150 
154 


157 


157 
159 
162 
165 


172 


174 
176 
178 


180 


182 
187 
191 
192 
193 


IV Relations, Functions and Cardinality 


11. Relations 201 
11.1. Relations 201 
11.2. Properties of Relations 205 
11.3. Equivalence Relations 210 
11.4. Equivalence Classes and Partitions 215 
11.5. The Integers Modulo n 218 
11.6. Relations Between Sets 221 
12. Functions 223 
12.1. Functions 223 
12.2. Injective and Surjective Functions 228 
12.3. The Pigeonhole Principle Revisited 233 
12.4. Composition 235 
12.5. Inverse Functions 238 
12.6. Image and Preimage 242 
13. Proofs in Calculus 244 
13.1. The Triangle Inequality 245 
13.2. Definition of a Limit 246 
13.3. Limits That Do Not Exist 249 
13.4. Limit Laws 251 
13.5. Continuity and Derivatives 256 
13.6. Limits at Infinity 258 
13.7. Sequences 261 
13.8. Series 265 
14. Cardinality of Sets 269 
14.1. Sets with Equal Cardinalities 269 
14.2. Countable and Uncountable Sets 275 
14.3. Comparing Cardinalities 280 
14.4. The Cantor-Bernstein-Schréder Theorem 284 
Conclusion 291 


Solutions 292 


Preface to the Third Edition 


y goal in writing this book has been to create a very inexpensive 

high-quality textbook. The book can be downloaded from my web 

page in PDF format for free, and the print version costs considerably less 
than comparable traditional textbooks. 

In this third edition, Chapter 3 (on counting) has been expanded, and a 
new chapter on calculus proofs has been added. New examples and exercises 
have been added throughout. My decisions regarding revisions have been 
guided by both the Amazon reviews and emails from readers, and I am 
grateful for all comments. 

I have taken pains to ensure that the third edition is compatible with the 
second. Exercises have not been reordered, although some have been edited 
for clarity and some new ones have been appended. (The one exception 
is that Chapter 3’s reorganization shifted some exercises.) The chapter 
sequencing is identical between editions, with one exception: The final 
chapter on cardinality has become Chapter 14 in order to make way for the 
new Chapter 13 on calculus proofs. There has been a slight renumbering of 
the sections within chapters 10 and 11, but the numbering of the exercises 
within the sections is unchanged. 

This core of this book is an expansion and refinement of lecture notes I 
developed while teaching proofs courses over the past 18 years at Virginia 
Commonwealth University (a large state university) and Randolph-Macon 
College (a small liberal arts college). I found the needs of these two audiences 
to be nearly identical, and I wrote this book for them. But I am mindful of a 
larger audience. I believe this book is suitable for almost any undergraduate 
mathematics program. 


RicuHarD HAMMACK Lawrenceville, Virginia 
February 14, 2018 


Introduction 


fl Pai is a book about how to prove theorems. 


Until this point in your education, mathematics has probably been 
presented as a primarily computational discipline. You have learned to 
solve equations, compute derivatives and integrals, multiply matrices and 
find determinants; and you have seen how these things can answer practical 
questions about the real world. In this setting your primary goal in using 
mathematics has been to compute answers. 

But there is another side of mathematics that is more theoretical than 
computational. Here the primary goal is to understand mathematical 
structures, to prove mathematical statements, and even to invent or discover 
new mathematical theorems and theories. The mathematical techniques 
and procedures that you have learned and used up until now are founded 
on this theoretical side of mathematics. For example, in computing the area 
under a curve, you use the fundamental theorem of calculus. It is because 
this theorem is true that your answer is correct. However, in learning 
calculus you were probably far more concerned with how that theorem could 
be applied than in understanding why it is true. But how do we know it is 
true? How can we convince ourselves or others of its validity? Questions of 
this nature belong to the theoretical realm of mathematics. This book is an 
introduction to that realm. 

This book will initiate you into an esoteric world. You will learn and 
apply the methods of thought that mathematicians use to verify theorems, 
explore mathematical truth and create new mathematical theories. This 
will prepare you for advanced mathematics courses, for you will be better 
able to understand proofs, write your own proofs and think critically and 
inquisitively about mathematics. 

The book is organized into four parts, as outlined below. 


1x 


PARTI Fundamentals 

¢ Chapter 1: Sets 

¢ Chapter 2: Logic 

¢ Chapter 3: Counting 

Chapters 1 and 2 lay out the language and conventions used in all advanced 
mathematics. Sets are fundamental because every mathematical structure, 
object, or entity can be described as a set. Logic is fundamental because it 
allows us to understand the meanings of statements, to deduce facts about 
mathematical structures and to uncover further structures. All subsequent 
chapters build on these first two chapters. Chapter 3 is included partly 
because its topics are central to many branches of mathematics, but also 
because it is a source of many examples and exercises that occur throughout 
the book. (However, the course instructor may choose to omit Chapter 3.) 


PART II Proving Conditional Statements 
¢ Chapter 4: Direct Proof 

¢ Chapter 5: Contrapositive Proof 

¢ Chapter 6: Proof by Contradiction 


Chapters 4 through 6 are concerned with three main techniques used for 
proving theorems that have the “conditional” form “If P, then Q.” 


PART III More on Proof 

¢ Chapter 7: Proving Non-Conditional Statements 
¢ Chapter 8: Proofs Involving Sets 

¢ Chapter 9: Disproof 

¢ Chapter 10: Mathematical Induction 


These chapters deal with useful variations, embellishments and conse- 
quences of the proof techniques introduced in Chapters 4 through 6. 


PARTIV Relations, Functions and Cardinality 
¢ Chapter 11: Relations 

¢ Chapter 12: Functions 

¢ Chapter 13: Proofs in Calculus 

¢ Chapter 14: Cardinality of Sets 


These final chapters are mainly concerned with the idea of functions, which 
are central to all of mathematics. Upon mastering this material you will be 
ready for advanced mathematics courses such as abstract algebra, analysis, 
topology, combinatorics and theory of computation. 


Free PDF version [S)= 


x Introduction 


The chapters are organized as in the following dependency tree. The 
left-hand column forms the core of the book; each chapter in this column 
uses material from all chapters above it. Chapters 3 and 13 may be omitted 
without loss of continuity. But the material in Chapter 3 is a great source 
of exercises, and the reader who omits it should ignore the later exercises 
that draw from it. Chapter 10, on induction, can also be omitted with no 
break in continuity. However, induction is a topic that most proof courses 
will include. 


Chapter 1 Dependency Tree 
Sets 
Chapter 2 
Chapter 3 
Counting 


Logic 


Chapter 4 
Direct Proof 


Chapter 5 
Contrapositive Proof 


Chapter 6 
Proof by Contradiction 


Chapter 7 
Non-Conditional Proof 


§3.5 and §3.6 used in some exercises. 
Ignore them if Chapter 3 is omitted. 


Chapter 8 
Proofs Involving Sets 


Chapter 9 
Disproof 


Chapter 10 
Mathematical Induction 


189 
Chapter 11 is 2 
Relations 1 & 
1 3 Md 
id 
10 © 
Ue 


Chapter 12 
Functions 


Chapter 13 
Proofs in Calculus 


Chapter 14 
Cardinality of Sets 


Richard Hammack Book of Proof 


Xi 


To the instructor. The book is designed for a three or four credit course. 
A course emphasizing discrete mathematics could cover chapters 1-12. A 
course that is more of a preparation for analysis could cover all but Chapter 3. 
The following timetable (for a fourteen-week semester) is a hybrid of these 
two options. Sections marked with « may require only the briefest mention 
in class, or may be best left for the students to digest on their own. 


Week | Monday Wednesday Friday 

1 Section 1.1 Section 1.2 Sections 1.3, 1.4 

2 Sections 1.5, 1.6, 1.7 Section 1.8 Sections 1.9*, 2.1 

3 Section 2.2 Sections 2.3, 2.4 Sections 2.5, 2.6 

4 Section 2.7 Sections 2.8%, 2.9 Sections 2.10, 2.11*, 2.12* 
5 Sections 3.1, 3.2, 3.3 Section 3.4, 3.5 Sections 3.5, 3.6 

6 EXAM Sections 4.1, 4.2, 4.3 Sections 4.3, 4.4, 4.5* 
7 Sections 5.1, 5.2, 5.3* Section 6.1 Sections 6.2 6.3* 

8 Sections 7.1, 7.2*, 7.3, 7.4 | Sections 8.1, 8.2 Section 8.3 

9 Section 8.4 Sections 9.1, 9.2, 9.3* | Section 10.1 

10 Sections 10.1, 10.4* Sections 10.2, 10.3 EXAM 

11 Sections 11.1, 11.2 Sections 11.3, 11.4 Sections 11.5, 11.6 
12 Section 12.1 Section 12.2 Section 12.2 

13 Sections 12.3, 12.4* Section 12.5 Sections 12.5, 12.6* 
14 Section 14.1 Section 14.2 Sections 14.3, 14.4* 


The entire book could be covered in a 4-credit course, or in a 3-credit 
course pitched to a more mature audience. 


Acknowledgments. I thank my students in VCU’s MATH 300 courses for 
offering feedback as they read the first edition of this book. Thanks especially 
to Cory Colbert and Lauren Pace for rooting out typographical and logical 
mistakes. Cory proofed early drafts of each chapter before I posted them 
to my web page, created the index, suggested some interesting exercises 
and wrote some solutions. Thanks to Moa Apagodu, Sean Cox, Brent Cody 
and Andy Lewis for suggesting improvements while teaching from the book, 
and to John Ganci for proofing the entire third edition. Mehmet Dagli 
caught further typos and inconsistencies while painstakingly translating 
the book into Turkish. I am indebted to Lon Mitchell, whose expertise with 
typesetting and on-demand publishing made the print version of this book 
a reality. 

And thanks to countless readers all over the world who contacted me 
concerning errors and omissions. Because of you, this is a better book. 


Free PDF version C\E 


Part I 


Fundamentals 


CHAPTER 1 


Sets 


ll of mathematics can be described with sets. This becomes more and 

more apparent the deeper into mathematics you go. It will be apparent 

in most of your upper level courses, and certainly in this course. The theory 

of sets is a language that is perfectly suited to describing and explaining all 
types of mathematical structures. 


1.1 Introduction to Sets 


A set is a collection of things. The things are called elements of the set. We 
are mainly concerned with sets whose elements are mathematical entities, 
such as numbers, points, functions, etc. 

A set is often expressed by listing its elements between commas, enclosed 
by braces. For example, the collection {2,4,6,8} is a set which has four 
elements, the numbers 2,4,6 and 8. Some sets have infinitely many elements. 
For example, consider the collection of all integers, 


f ARB OOM Sa 


Here the dots indicate a pattern of numbers that continues forever in both 
the positive and negative directions. A set is called an infinite set if it has 
infinitely many elements; otherwise it is called a finite set. 

Two sets are equal if they contain exactly the same elements. Thus 
{2,4,6,8} = {4,2,8,6} because even though they are listed in a different order, 
the elements are identical; but {2,4,6,8} 4 {2,4,6,7$. Also 


qd 80 1019,9.4...) 310/91 10 98 84a}. 


We often let uppercase letters stand for sets. In discussing the set 
{2,4,6,8} we might declare A = {2,4,6,8} and then use A to stand for {2,4,6,8}. 
To express that 2 is an element of the set A, we write 2¢ A, and read this as 
“2 is an element of A,” or “2is in A,” or just “2in A.” We also have 4€A,6€A 
and 8¢€ A, but 5¢A. We read this last expression as “5 is not an element of 
A,” or “5 not in A.” Expressions like 6,2¢ A or 2,4,8¢€ A are used to indicate 
that several things are in a set. 


4 Sets 


Some sets are so significant that we reserve special symbols for them. 
The set of natural numbers (i.e., the positive whole numbers) is denoted 
by N, that is, 

N= {1,2,3,4,5,6,7,...}. 


The set of integers 
2a ge 820 1010-8 Ay) 


is another fundamental set. The symbol R stands for the set of all real 
numbers, a set that is undoubtedly familiar to you from calculus. Other 
special sets will be listed later in this section. 

Sets need not have just numbers as elements. The set B = {T, F} consists 
of two letters, perhaps representing the values “true” and “false.” The set 
C = {a,e,i,o,u} consists of the lowercase vowels in the English alphabet. 
The set D = {(0,0),(1,0),(0, 1),(1, 1)} has as elements the four corner points 
of a square on the x-y coordinate plane. Thus (0,0) € D, (1,0) € D, etc., 
but (1,2) ¢ D (for instance). It is even possible for a set to have other sets 
as elements. Consider E = {1, {2,3}, {2,4}}, which has three elements: the 
number 1, the set {2,3} and the set {2,4}. Thus le E and {2,3} ¢ E and 
{2,44eH. But note that 2¢E,3¢E and4¢E£. 

Consider the set M = {[$ 3],[42],[49]} of three two-by-two matrices. We 
have [9°] eM, but [} 1] ¢M. Letters can serve as symbols denoting a set’s 
elements: If a=[$ 3], b=[}9] and c=[} 9], then M = {a,b,c}. 

If X is a finite set, its cardinality or size is the number of elements 
it has, and this number is denoted as |X|. Thus for the sets above, |A| = 4, 
|B| = 2, |C| =5, |D| =4, |E| =3 and |M|=3. 

There is a special set that, although small, plays a big role. The empty 
set is the set {} that has no elements. We denote it as Z, so d = {}. Whenever 
you see the symbol ¢, it stands for {}. Observe that |g| = 0. The empty set 
is the only set whose cardinality is zero. 

Be careful in writing the empty set. Don’t write {¢} when you mean 9g. 
These sets can’t be equal because ¢ contains nothing while {g} contains 
one thing, namely the empty set. If this is confusing, think of a set as a 
box with things in it, so, for example, {2,4,6,8} is a “box” containing four 
numbers. The empty set ¢ = {} is an empty box. By contrast, {@} is a box 
with an empty box inside it. Obviously, there’s a difference: An empty box 
is not the same as a box with an empty box inside it. Thus @ # {g}. (You 
might also note |¢| =0 and |{o}|=1 as additional evidence that ¢ # {g}.) 


Richard Hammack Book of Proof 


Introduction to Sets 5 


This box analogy can help us think about sets. The set F = {9, {a}, {{a}}} 
may look strange but it is really very simple. Think of it as a box containing 
three things: an empty box, a box containing an empty box, and a box 
containing a box containing an empty box. Thus |F| = 3. The set G = {N, Z} 
is a box containing two boxes, the box of natural numbers and the box of 
integers. Thus |G| = 2. 

A special notation called set-builder notation is used to describe sets 
that are too big or complex to list between braces. Consider the infinite set 
of even integers E = {...,-6,—4,—-2,0,2,4,6,...}. In set-builder notation this 
set is written as 

E={2n:neZ}. 


We read the first brace as “the set of all things of form,” and the colon as 
“such that.” So the expression E = {2n: ne Z} reads as “E equals the set of 
all things of form 2n, such that n is an element of Z.” The idea is that E 
consists of all possible values of 2n, where n takes on all values in Z. 

In general, a set X written with set-builder notation has the syntax 


X = {expression : rule}, 


where the elements of X are understood to be all values of “expression” that 
are specified by “rule.” For example, above E£ is the set of all values of the 
expression 2n that satisfy the rule n € Z. There can be many ways to express 
the same set. For example, E = {2n:ne Z} = {n:n is an even integer} = 
{n:n=2k,k€Z}. Another common way of writing it is 


E={neéZ:n is even}, 


read “E is the set of all n in Z such that n is even.” Some writers use a bar 
instead of a colon; for example, E = {n€ Z|n is even}. We use the colon. 


Example 1.1 Here are some further illustrations of set-builder notation. 
{n:nis a prime number} = {2,3,5,7,11,13,17,...} 

{n€N:n is prime} = {2,3,5,7,11,13,17,...} 

in? ne Z}=10,1,4,9,16,95,..,} 

{xe R:x2-2=0} = {V2,-V2} 

{xe Z:x?-2=O0l=¢ 

fx €Z: |x| <4} = {-3,-2,-1,0,1,2,3} 

{2x :% € Z,|x| <4} = {-6,-4,-2,0,2,4,6} 

{x€Z:|2x| <4} ={-1,0,1} 


Sn on Pp wh 


Free PDF version [S)= 


6 Sets 


Items 6-8 above highlight a conflict of notation that we must always 
be alert to. The expression |X| means absolute value if X is a number 
and cardinality if X is a set. The distinction should always be clear from 
context. Consider {x € Z:|x|<4} in Example 1.1 (6) above. Here x€ Z, so x 
is a number (not a set), and thus the bars in |x| must mean absolute value, 
not cardinality. On the other hand, suppose A = {{1,2}, {3,4,5,6}, {7}! and 
B={X€A:|X|<3}. The elements of A are sets (not numbers), so the |X| in 
the expression for B must mean cardinality. Therefore B = {{1,2}, {7}}. 


Example 1.2 Describe the set A = {7a + 3b : a,be Z}. 

Solution: This set contains all numbers of form 7a +3b, where a and b 
are integers. Each such number 7a + 3d is an integer, so A contains only 
integers. But which integers? If n is any integer, then n = 7n + 3(—2n), so 
n=7a+8b where a=n and b =~—2n. Therefore n€ A. We’ve now shown that 
A contains only integers, and also that every integer is an element of A. 
Consequently A = Z. 


We close this section with a summary of special sets. These are sets 
that are so common that they are given special names and symbols. 
¢ The empty set: d= {} 
e¢ The natural numbers: N = {1,2,3,4,5,...} 
e The integers: Z={...,-3,-2,-1,0,1,2,3,4,5,...} 
e The rational numbers: Q= {x x= =~ where m,neé Z andn# of 


e The real numbers: R 


We visualize the set R of real numbers as an infinitely long number line. 


-4 -3 -2 -1 0 1 2 3 4 
<<} tt _—_ >_> 


Notice that Q is the set of all numbers in R that can be expressed as a 
fraction of two integers. You may be aware that QR, as V2¢Q but V2eER. 
(If not, this point will be addressed in Chapter 6.) 

In calculus you encountered intervals on the number line. Like R, these 
too are infinite sets of numbers. Any two numbers a,b €R with a <b give 
rise to various intervals. Graphically, they are represented by a darkened 
segment on the number line between a and 0. A solid circle at an endpoint 
indicates that that number is included in the interval. A hollow circle 
indicates a point that is not included in the interval. 


Richard Hammack Book of Proof 


Introduction to Sets 7 


¢ Closed interval: [a,b]={xeR:a<x<b} —— 
¢ Open interval: (a,b)={xeR:a<x<b} a en 
e Half-open interval: (a,b]={xeR:a<x<b} oo J ee 
¢ Half-open interval: [a,b)={xeR:a<x<b} ee 
¢ Infinite interval: (a,oo) = {xe R:a<x} <—__o—_—> 
¢ Infinite interval: [a,oo)={xeR:a<x} <—_.———> 
e Infinite interval: (—oo,b) = {xe R:x <b} —{€_4; 
¢ Infinite interval: (—oo,b]={xeR:x <b} a 


Each of these intervals is an infinite set containing infinitely many 
numbers as elements. For example, though its length is short, the interval 
(0.1,0.2) contains infinitely many numbers, that is, all numbers between 
0.1 and 0.2. It is an unfortunate notational accident that (a,b) can denote 
both an open interval on the line and a point on the plane. The difference 
is usually clear from context. In the next section we will see yet another 
meaning of (a,b). 


Exercises for Section 1.1 
A. Write each of the following sets by listing their elements between braces. 


1. {5x-1:x€Z} 9. {xe R:sinax = 0} 
2. {8x+2:x€Z} 10. {x€R:cosx = 1} 
3. {x€Z:-2<x<7} 11. {x€Z:|x| <5} 

4. {xeN:-2<x<=7} 12. {xe Z:|2x| <5} 
5. {xe R: x7 = 3} 13. {x€ Z:|6x| <5} 
6. {xe R:x2 = 9} 14, {5x:x€ Z,|2x| <8} 
7. {xe R:x7+5x=-6} 15. {5a+2b:a,beZ} 
8. {xe R:x3+5x? = -6x} 16. {6a+2b:a,be Z} 


B. Write each of the following sets in set-builder notation. 


17. {2,4,8,16,32,64...} 23. {3,4,5,6,7,8} 

18. {0,4,16,36,64,100,...} 24, {-4,-3,-2,-1,0,1,2} 

19. {...,-6,-3,0,3,6,9, 12, 15,...} 25. {...,3,45551,2,4,8,...} 

20. {...,-8,-3,2,7,12,17,...} 26. {...,57,9,4>1,3,9,27,...} 

21. {0,1,4,9,16,25,36,...} 27. {...,-1,-%,0, 2, m, 32,20, 3,...} 
22. {3,6,11,18,27,38,...} 28. {...,-3,-3,0,3,3,2,3,3,2,...} 


Free PDF version C\E 


8 Sets 


C. Find the following cardinalities. 


29. |{{1}, {2, {3,4}}, a} 34. |{x EN: |x| < 10}| 
30. |{{1,4},0,6,{{3,4}}, {oF 35. |{x€ Z: x7 < 10}| 
31. |{{{1}, {2, {3, 4}}, o}}| 36. |{x EN: x? < 10}| 
32. |{{{1,4},a,b, {{3,4t}, {o}t}| 37. |{xeN:x? <0}| 
33. | {xe Z: |x| < 10}| 38. |{xeN:5x < 20}| 
D. Sketch the following sets of points in the x-y plane. 
39. {(x,y):x€[1,2],y €[1,2]} 46. {(x,y):x,yER, x+y? <1} 
40. {(x,y):x€ [0,1], €[1,2]} AT. {(x,y):x,yER, y=x7-1} 
41. {(x,y):x¢[-1,1],y=1} 48. {(x,y):x,yeER,x> 1} 
42. {(x,y):x =2,ye[0, 1]} 49. {(x,x+y):xER,ye Z} 
43. {(x,y): |x| =2,y €[0, 1} 50. {(x,©):x€R,y € N} 
44, {(x,x?):2€R} Bl. {(x,y) eR? : (yxy +x) = 0} 
45. {(x,y):x, ye R,x2+ y= 1} 52. {(x,y) eR? : (y—x?)(y +x?) = 0} 


1.2 The Cartesian Product 


Given two sets A and B, it is possible to “multiply” them to produce a new 
set denoted as A x B. This operation is called the Cartesian product. To 
understand it, we must first understand the idea of an ordered pair. 


Definition 1.1 An ordered pair is a list (x,y) of two things x and y, 
enclosed in parentheses and separated by a comma. 


For example, (2,4) is an ordered pair, as is (4,2). These ordered pairs are 
different because even though they have the same things in them, the order 
is different. We write (2,4) 4 (4,2). Right away you can see that ordered pairs 
can be used to describe points on the plane, as was done in calculus, but 
they are not limited to just that. The things in an ordered pair don’t have 
to be numbers. You can have ordered pairs of letters, such as (£,m), ordered 
pairs of sets such as ({2,5}, {3,2}), even ordered pairs of ordered pairs like 
((2,4),(4,2)). The following are also ordered pairs: (2, {1,2,3}), (R,(0,0)). Any 
list of two things enclosed by parentheses is an ordered pair. Now we are 
ready to define the Cartesian product. 


Definition 1.2 The Cartesian product of two sets A and B is another 
set, denoted as A x B and defined as A x B= {(a,b):a€ A,beB}. 


Richard Hammack Book of Proof 


The Cartesian Product 9 


Thus A xB is a set of ordered pairs of elements from A and B. For 
example, if A = {k,@,m} and B= {q,r}, then 


Ax B= {(k,q),(k,r),(0,q),(6,r),(m,q),(m,r)}. 


Figure 1.1 shows how to make a schematic diagram of A x B. Line up the 
elements of A horizontally and line up the elements of B vertically, as if 
A and B form an x- and y-axis. Then fill in the ordered pairs so that each 
element (x,y) is in the column headed by x and the row headed by y. 


B AxB 
r (kr) (@,r) (m,r) 
qd (k,q) (@,q) (m,q) 
Ck fm )A 


Figure 1.1. A diagram of a Cartesian product 


For another example, {0,1} x {2,1} = {(0,2),(0, 1),(1,2),(1, D}. If you are a 
visual thinker, you may wish to draw a diagram similar to Figure 1.1. The 
rectangular array of such diagrams give us the following general fact. 


Fact 1.1 IfA and B are finite sets, then |A x B| =|A|-|BI. 


Example 1.3 Let A = {0,0,6,63, 8, 3} be the set consisting of the six faces 
of a dice. The Cartesian product A x A is diagramed below. By Fact 1.1 (or 
by simple counting), |A x A| =6-6=36. We might think of A x A as the set 
of possible outcomes in rolling a dice two times in a row. Each element of 
the product is an ordered pair of form (result of 1st roll, result of 2nd roll). 
Such constructions are useful in the study of probability. 


A AxA 


Free PDF version [S)= 


10 Sets 


The set Rx R= {(x,y):x,y€R} should be very familiar. It can be viewed 
as the set of points on the Cartesian plane, as drawn in Figure 1.2(a). The 
set Rx N= {(x,y): x2 €R,y €N} can be regarded as all of the points on the 
plane whose second coordinate is a natural number. This is illustrated in 
Figure 1.2(b), which shows that R x N looks like infinitely many horizontal 
lines at integer heights above the x-axis. The set NxN is the set of all points 
on the plane whose coordinates are both natural numbers. It looks like a 
grid of dots in the first quadrant, as illustrated in Figure 1.2(c). 


(a) (b) 


Figure 1.2. Drawings of some Cartesian products 


It is even possible for one factor of a Cartesian product to be a Cartesian 
product itself, as in Rx (Nx Z) = {(x,(y,z)):xER, (y,z)ENx Z}. 

We can also define Cartesian products of three or more sets by moving 
beyond ordered pairs. An ordered triple is a list (x,y,z). The Cartesian 
product of the three sets R, N and Z is Rx Nx Z= {(x,y,z):xER, yEN, ze Z}. 
Of course there is no reason to stop with ordered triples. In general, 


Ay x Agx +++ x An = {(x1,%2,...,%n) : x; € A; for each i = 1,2,...,n}. 


Be mindful of parentheses. There is a slight difference between Rx (Nx Z) 
and Rx Nx Z. The first is a Cartesian product of two sets; its elements are 
ordered pairs (x,(y,z)). The second is a Cartesian product of three sets; its 
elements are ordered triples (x,y,z). To be sure, in many situations there is 
no harm in blurring the distinction between expressions like (x,(y,z)) and 
(x,y,z), but for now we regard them as different. 

For any set A and positive integer n, the Cartesian power A” is 


AT=AXAX-+ x A= {(x1,%9,...,%n) 1 X1,%2,-.-,%n CAS. 


In this way, R? is the familiar Cartesian plane and R? is three-dimensional 
space. You can visualize how, if R? is the plane, then Z? = {(m,n):m,n eZ} 
is a grid of points on the plane. Likewise, as R° is 3-dimensional space, 
Z? = {(m,n,p):m,n,p € Z} is a grid of points in space. 


Richard Hammack Book of Proof 


The Cartesian Product 11 


In other courses you may encounter sets that are very similar to R”, but 
yet have slightly different shades of meaning. Consider, for example, the 
set of all two-by-three matrices with entries from R: 

M={[z52]: u,v,w,x,y,2€R}. 
This is not really all that different from the set 
R® = {(u,v,w,x, 9,2) : U,v,w,«,y,2 € Rf. 


The elements of these sets are merely certain arrangements of six real 
numbers. Despite their similarity, we maintain that M 4 R®, for two-by- 
three matrices are not the same things as sequences of six numbers. 


Example 1.4 Represent the two sides of a coin by the set S = {u,t} . The 
possible outcomes of tossing the coin seven times in a row can be described 
with the Cartesian power S’. A typical element of S” looks like 


(H,H,T,H,T,T,T), 


meaning a head was tossed first, then another head, then a tail, then a head 
followed by three tails. Note that |S”| = 2” = 128, so there are 128 possible 
outcomes. If this is not clear, then it will be explained fully in Chapter 3. 


Exercises for Section 1.2 


A. Write out the indicated sets by listing their elements between braces. 
1. Suppose A = {1,2,3,4} and B= {a,c}. 


(a) AxB (c) AxA (e) xB (g) Ax(BxB) 
(b) BxA (d) BxB (f) (AxB)xB (h) B® 

2. Suppose A = {z,e,0} and B = {0,1}. 
(a) AxB (c) AxA (e) Ax@ (g) Ax(BxB) 
(b) Bx A (d) BxB (f) (AxB)xB (h) AxBxB 

3. {xeER: x? =2} x {a,c,e} 6. {xe Rix? =x} x {eeN:x? =x} 

A, {neZ:2<n<5}x {ne Z:|n|=5} 7. {a} x {0, a} x {0,1} 

5. {xe R:x? =2} x {xe R: |x| = 2} 8. {0,1}4 

B. Sketch these Cartesian products on the x-y plane R? (or R® for the last two). 

9. {1,2,3} x {-1,0,1} 15. {1} x [0,1] 

10. {-1,0,1} {1,2,3} 16. [0,1] x {1} 

11. [0,1] [0,1] 17. NxZ 

12. [-1,1]« [1,2] 18. ZxZ 

13. {1,1.5,2} x [1,2] 19. [0,1] [0,1] x [0,1] 

14, [1,2] x {1,1.5,2} 20. {(x, y)ER? :x2+y2<1} x0, 1] 


Free PDF version [S)= 


12 Sets 


1.3 Subsets 


It can happen that every element of a set A is an element of another set B. 
For example, each element of A = {0,2,4} is also an element of B = {0,1,2,3, 4}. 
When A and B are related this way we say that A is a subset of B. 


Definition 1.3. Suppose A and B are sets. If every element of A is also 
an element of B, then we say A is a subset of B, and we denote this as 
ACB. We write A ZB if A is not a subset of B, that is, if it is not true that 
every element of A is also an element of B. Thus A¢B means that there 
is at least one element of A that is not an element of B. 


Example 1.5 Be sure you understand why each of the following is true. 
1. {2,3,7} < {2,3,4,5,6, 7} 

{2,3,7} ¢ {2,4,5,6,7} 

(37a (9 37} 

{(x,sin(x)) : xe R} CR? 

{1.9:5.71113)17.0l ON 

NCZCQCR 

RxNCRxR 

ACA for any set A 

DED. 


COHN A MA PR WN 


This brings us to a significant fact: If B is any set whatsoever, then ¢ CB. 
To see why this is true, look at the last sentence of Definition 1.3. It says 
that ¢ ¢B would mean that there is at least one element of @ that is not an 
element of B. But this cannot be so because ¢ contains no elements! Thus 
it is not the case that ¢ Z B, so it must be that @cB. 


Fact 1.2 The empty set is a subset of all sets, that is, 6 < B for any set B. 


Here is another way to look at it. Imagine a subset of B as a thing you 
make by starting with braces {}, then filling them with selections from B. 
For example, to make one particular subset of B = {a,b,c}, start with {}, 
select b and c from B and insert them into {} to form the subset {b,c}. 
Alternatively, you could have chosen just a to make {a}, and so on. But one 
option is to simply select nothing from B. This leaves you with the subset {}. 
Thus {}<B. More often we write it as dCB. 


Richard Hammack Book of Proof 


Subsets 13 


This idea of “making” a subset can help us list out all the subsets of a 
given set B. As an example, let B = {a,b,c}. Let’s list all of its subsets. One 
way of approaching this is to make a tree-like structure. Begin with the 
subset {}, which is shown on the left of Figure 1.3. Considering the element 
a of B, we have a choice: insert it into {}, or not. The lines from {} point to 
what we get depending whether or not we insert a, either {} or {a}. Now 
move on to the element b of B. For each of the sets just formed we can either 
insert or not insert b, and the lines on the diagram point to the resulting 
sets {}, {b},{a}, or {a,b}. Finally, to each of these sets, we can either insert 
c or not insert it, and this gives us, on the far right-hand column, the sets 
{t, {ce}, {bo}, {b,c}, {a}, {a,c}, {a,b} and {a,b,c}. These are the eight subsets 
of B = {a,b,c}. 


Inserta? Insert 5? Insert c? 


| | 


{ eae No 
-—— U"—— Yes 


Ve” {c} 

a yyy —— No “ 
“a os 

Se eee 
Yes _ ae Yes__ ta,c} 
ee : __— {a,b} 


s 
“abe 


~— Yes ~~ {a,b,c} 


Figure 1.3. A “tree” for listing subsets 


We can see from the way this tree branches that if it happened that 
B = {a}, then B would have just two subsets, those in the second column of 
the diagram. If it happened that B = {a,b}, then B would have four subsets, 
those in the third column, and so on. At each branching of the tree, the 
number of subsets doubles. So in general, if |B| =n, then B has 2” subsets. 


Fact 1.3 Ifa finite set has n elements, then it has 2” subsets. 


Free PDF version C\E 


14 Sets 


For a slightly more complex example, consider listing the subsets of 
B= {1,2,{1,3}. This B has just three elements: 1, 2 and {1,3}. At this point 
you probably don’t even have to draw a tree to list out B’s subsets. You just 
make all the possible selections from B and put them between braces to get 


i, 12h HLS, tht (LtL3ayt, 12{L3yt, (L211 3. 


These are the eight subsets of B. Exercises like this help you identify what 
is and isn’t a subset. You know immediately that a set such as {1,3} is not a 
subset of B because it can’t be made by inserting elements from B into {}, 
as the 3 is not an element of B and thus is not a valid selection. Notice that 
although {1,3} ¢ B, it is true that {1,3} B. Also, {{1,3}} cB. 


Example 1.6 Be sure you understand why the following statements are 
true. Each illustrates an aspect of set theory that you’ve learned so far. 


di. “ETA wieiceniie weenie temieeeneeied 1 is the first element listed in {1, {1}} 
2s. LEAL ATH ood kuienatoipar le eas aeekne he wraneedneaen because 1 is not a set 
Boi TEPC T TAA vie srs ele ategs walenara ve {1} is the second element listed in {1, {1}} 
Ae Te ATA, b4 0 inl toe dew anaes make subset {1} by selecting 1 from {1, {1}} 
Dee ea eas because {1,{1}} contains only 1 and {1}, and not {{1}} 
Be ew iinannctanas make subset {{1}} by selecting {1} from {1, {1}} 
Ti.  UNGEIN: saoutics Saget N is a set (not a number) and N contains only numbers 
Be ING Nace actutine A hacen ada we tenses tee because X ¢ X for every set X 
9: DEN wasecddcagiawiees because the set N contains only numbers and no sets 
NO, SO: SIN 3 esan gece tek cee aniseed Signin oe because @ is a subset of every set 


Ts ON EANY pier ied inet nade cas Bene’ because {N} has just one element, the set N 
12. UREN in ce eect Gos inate te hes ot because, for instance, le N but 1 ¢ {N} 
1S. DEHN Seccies coh tw dae ears note that the only element of {N} is N, andN#@ 
14 PO SIN) os catnrrarorananaeadwat neuen tae because ¢ is a subset of every set 
1D 1 EOIN bears bs ua caniaie scene aaeeeeaaneaees ¢ is the first element listed in {a,N} 
16s GEIO NM  .tu itil boenleic bau Gentle because ¢ is a subset of every set 
Bis ANSSIOIM puis sar sedans dene ines make subset {N} by selecting N from {¢,N} 
TS, ANAT OATS os oy oh aoe Coin aah Cais anoenaitiannneeas ans because N ¢ {@, {N}} 
19; INVEIG ANY oieiuaeterentenbees {N} is the second element listed in {@, {N}} 
20. §(1,2),(2,2),(7, DE ONXNov.ceceeceeseeeees each of (1,2), (2,2), (7,1) is in NxN 


Though they should help you understand the concept of subset, the 
above examples are somewhat artificial. But in general, subsets arise very 
naturally. For instance, consider the unit circle C = {(x,y) €R? : x7 + y? = 1}. 


Richard Hammack Book of Proof 


Power Sets 15 


This is a subset CC R?. Likewise the graph of a function y = f(x) is a set 
of points G = {(x,f(x)):x¢R}, and GCR?. Surely sets such as C and G 
are more easily understood or visualized when regarded as subsets of R?. 
Mathematics is filled with such instances where it is important to regard 
one set as a subset of another. 


Exercises for Section 1.3 
A. List all the subsets of the following sets. 


1. {1,2,3,4} 5. {} 
2. {1,2, a} 6. {R,Q,N} 
3. {{R}} 7. {R,{Q,N}} 
4. © 8. {{0,1}, {0,1, {2}}, {o}} 
B. Write out the following sets by listing their elements between braces. 
9. {X :X <{3,2,a} and |X| =2} 11. {X:X ¢ {3,2,a} and |X| = 4} 
10. {XCN:|X|<1} 12. {X:X < {3,2,a} and |X|=1} 
C. Decide if the following statements are true or false. Explain. 
13. R? oR? 15. {(x,y)eR? :x-1=0}<{(x,y)ER? :x?—-x«=0} 
14. R°cR® 16. {(x,y)ER? :x?-x=0}C{(x,y)eR? :x-1=0} 


1.4 Power Sets 


Given a set, you can form a new set with the power set operation. 


Definition 1.4 If A is a set, the power set of A is another set, denoted 
as (A) and defined to be the set of all subsets of A. In symbols, A(A) = 
{X:X CA}. 


For example, suppose A = {1,2,3}. The power set of A is the set of all 
subsets of A. We learned how to find these subsets in the previous section, 
and they are {}, {1}, {2}, {3}, {1,2}, {1,3}, {2,3} and {1,2,3}. Therefore the 
power set of A is 


PA)={ 0, {1}, {2h {3}, {1,2}, {1,3}, 2,3}, {12,3} } 


As we saw in the previous section, if a finite set A has n elements, then 
it has 2” subsets, and thus its power set has 2” elements. 


Fact 1.4 If A isa finite set, then | A(A)| =2'!. 


Free PDF version [S)= 


16 Sets 


Example 1.7 You should examine the following statements and make 
sure you understand how the answers were obtained. In particular, notice 
that in each instance the equation | A(A)| = 2'4! is true. 


P ({0,1,3}) ={G, {0}, {Lf, {3}, {0,1}, 10,3}, {1,3}, {0, 1,3} } 
P ({1,2}) ={9, {1p 12}, {1,2} } 

FP ({1}) ={9, {1p} 

P(D)={o} 

P ({at) ={@, ta} } 

P ({O}) ={, {Oh} 

P ({a}) x P({D}) ={@,O), (G,49}), (lat.9), (Lat. {o}) } 
P(P ({O})) =1G, 1}, HOH, 1D. 1h} 
( 
( 


pw onan rwnr 


- A({L{12}}) =19, (1, HL 2bb {LL 2H } 
10. A({Z,N}) ={o, {Z}, {N}, {Z,N}} 


Next are some that are wrong. See if you can determine why they are 
wrong and make sure you understand the explanation on the right. 


TT RSMO | taser aren east stents meaningless because 1 is not a set 
same Ble e422 Fe oe eg oe es LG ee wrong because {1,2} ¢ {1, {1,2}} 
13. A ((1,{1.2H) = {o.{{1}}.{(1,2H}.4L {L2H .-. wrong because {{1}} ¢ {1,{1,2H} 


If A is finite, then it is possible (though maybe not practical) to list out 
(A) between braces as was done in the above example. That is not possible 
if A is infinite. For example, consider A(N). If you start listing its elements 
you quickly discover that N has infinitely many subsets, and it’s not clear 
how (or if) they could be arranged in a list with a definite pattern: 


PAN) = {9, {1}, {2},..., {1,2}, {1,3},..., {39,47}, 
ABST ISR OAC Si ete, 


The set A(R) is mind boggling. Think of R? = {(x, y):x,y eR} as the set 
of all points on the Cartesian plane. A subset of R? (that is, an element of 
Y(R?)) is a set of points in the plane. Let’s look at some of these sets. Since 
{(1,2),(1, D} ¢ R?, we know that {(1,2),(1,1)} « A(R?). We can even draw a 
picture of this subset, as in Figure 1.4(a). For another example, the graph 
of the equation y = x” is the set of points G = {(x,x?):x eR} and this is a 
subset of R?, so Ge A(R). Figure 1.4(b) is a picture of G. Because this can 
be done for any function, the graph of any imaginable function f :R—R is 
an element of A(R’). 


Richard Hammack Book of Proof 


Pow 


er Sets 


(a) 


(b) 


y 


INFINITE 


(c) 


Figure 1.4. Three of the many, many sets in A(R?) 


17 


In fact, any black-and-white image on the plane can be thought of as 
a subset of R?, where the black points belong to the subset and the white 
points do not. So the text “INFINITE” in Figure 1.4(c) is a subset of R? and 
therefore an element of A(R?). By that token, A(R?) contains a copy of the 


page you are reading now. 


Thus, in addition to containing every imaginable function and every 
imaginable black-and-white image, A(R?) also contains the full text of every 
book that was ever written, those that are yet to be written and those that 
will never be written. Inside of A(R?) is a detailed biography of your life, 
from beginning to end, as well as the biographies of all of your unborn 
descendants. It is startling that the five symbols used to write A(R?) can 
express such an incomprehensibly large set. 

Homework: Think about A( A(R?)). 


Exercises for Section 1.4 


A. Write the following sets by listing their elements between braces. 


_P 
P 
_ DP 
P 
P 


ee 


{{a,b},{c}}) 
{1,2,3,4}) 
{{0},5}) 
{R,Q}) 

P ({2})) 

{1,2}) x P ({3}) 


Veen eee re ee ee 


7. P({a,b}) x A({0, 1}) 

8. A({1,2} x {3}) 

9. P({a,b} x {0}) 

10. {X € A({1,2,3}) :|X| <1} 
11. {X c A({1,2,3}) |X| <1} 
12. {X € P({1,2,3}):2e Xx} 


B. Suppose that |A| =m and |B| =n. Find the following cardinalities. 


13 
14 
15 
16 


-|\AA(AA)))| 
- |A(A(A))| 
. |A(AxB)| 
. |A(A)x A(B)| 


17. |{X « A(A):|X| < 1}| 
18. | A(A x AB))| 

19. | A(A(A(A x S)))| 
20. |{X ¢ P(A) :|X| <1} 


Free PDF version C\E 


18 Sets 


1.5 Union, Intersection, Difference 


Just as numbers are combined with operations such as addition, subtraction 
and multiplication, there are various operations that can be applied to sets. 
The Cartesian product (defined in Section 1.2) is one such operation; given 
sets A and B, we can combine them with x to get a new set Ax B. Here are 
three new operations called union, intersection and difference. 


Definition 1.5 Suppose A and B are sets. 
The union of A and B is the set AUB={x:xeEA or xeB}. 
The intersection of A and Bistheset AnB={x:xeA and xeB}. 
The difference of A and Bistheset A-B={x:xeA and x¢B}. 


In words, the union A UB is the set of all things that are in A or in B (or 
in both). The intersection ANB is the set of all things in both A and B. The 
difference A —B is the set of all things that are in A but not in B. 


Example 1.8 Suppose A = {a,b,c,d,e}, B={d,e,f} and C = {1,2,3}. 


AUB= {a,b,c,d,e,f} 
AnB={d,e} 

A-B= {a,b,c} 
B-A={f} 


(A-B)U(B-A)= {a,b,c, f} 

AUC = {a,b,c,d,e,1,2,3} 

ANC=9 

A-C= {a,b,c,d,e} 

(AnC)U(A-C)= {a,b,c,d,e} 

10. (AnB)xB= {(d,d),(d,e),(d,f),(e,d),(e,e),(e, f)} 

11. (AxC)n(BxC)={(d,D),(d,2),(d,3),(e, D), (e, 2), (e,3)} 


CaAOnNaanarwnr 


Parts 12-15 use interval notation (Section 1.1), so [2,5] ={xeR:2<x <5}, 
etc. Sketching these on the number line may aid your understanding. 


12. [2,5]U[8,6] = [2,6] 
13. [2,5]n[8,6] =[8,5] 
14. [2,5]—[8,6] =[2,3) 
15. [0,3]—[1,2] =[0,1)U(2,3] 


Observe that for any sets X and Y it is always true that X UY =YuUX 
and XnY =YnX, but in general X-Y 4Y-X. 


Richard Hammack Book of Proof 


Union, Intersection, Difference 19 


Example 1.9 Let A = {(x,x”): x € R} be the graph of the equation y = x? 
and let B = {(x,x+2):x €R} be the graph of the equation y = x +2. These sets 
are subsets of R?. They are sketched together in Figure 1.5(a). Figure 1.5(b) 
shows A UB, the set of all points (x, y) that are on one (or both) of the two 
graphs. Observe that AnB = {(-1,1),(2,4)} consists of just two elements, 
the two points where the graphs intersect, as illustrated in Figure 1.5(c). 
Figure 1.5(d) shows A —B, which is the set A with “holes” where B crossed it. 
In set builder notation, we could write AUB = {(x, y):x€R,y =x? or y=x+2} 
and A -B = {(x,x”):xeR—{-1,2}}. 

t fAuB ° 
B AnB 


(a) (b) (c) (d) 


Figure 1.5. The union, intersection and difference of sets A and B 


Exercises for Section 1.5 
1. Suppose A = {4,3,6,7,1,9}, B = {5,6,8,4} and C = {5,8,4}. Find: 


(a) AUB (d) A-C (g) BNC 
(b) ANB (e) B-A (h) BUC 
(c) A-B (f) AnC Gi) C-B 
2. Suppose A = {0,2,4,6,8}, B = {1,3,5,7} and C = {2,8,4}. Find: 
(a) AUB (d) A-C (g) BoC 
(b) AnB (e) B-A (h) C-A 
(c) A-B (f) ANC (i) C-B 


3. Suppose A = {0,1} and B= {1,2}. Find: 


(a) (Ax B)n(B xB) 
(b) (Ax B)uU(B xB) 
(ec) (Ax B)-(BxB) 


(a) (Ax B)n(B xB) 
(b) (Ax B)uU(B xB) 
(ec) (Ax B)-(BxB) 


(d) (AnNB)xA 
(e) (AxB)nB 
(ff) A(A)n A(B) 


. Suppose A = {b,c,d} and B= {a,b}. Find: 


(d) (AnB)xA 
(e) (AxB)nB 
(f) A(A)n AB) 


(g) A(A)- A(B) 
(h) A ANB) 
(i) A(A xB) 


(g) A(A)- ACB) 
(h) A(AnB) 
(i) A(A)x AB) 


Free PDF version 


20 Sets 


5. Sketch the sets X = [1,3] [1,3] and Y = [2,4] x [2,4] on the plane R?. On separate 
drawings, shade in the sets X UY, XnY, X-Y and Y -X. (Hint: X and Y are 
Cartesian products of intervals. You may wish to review how you drew sets like 
[1,3] x [1,3] in the exercises for Section 1.2.) 

6. Sketch the sets X = [-1,3]«[0,2] and Y = [0,3]x[1,4] on the plane R?. On separate 
drawings, shade in the sets XUY, XnNY, X-Y and Y-X. 

7. Sketch the sets X = {(x,y)€ R?:x7+y? <1} and Y = {(x,y)€ R?: x= 0} on R?. On 
separate drawings, shade in the sets X UY, XNY, X-Y and Y-X. 

8. Sketch the sets X = {(x,y)€ R?:x7+y? <1} and Y = {(x,y)eR?:-1< y<0} on R?. 
On separate drawings, shade in the sets XUY, XNY, X-Y and Y-X. 

9. Is the statement (R x Z)n(Zx R) = Zx Z true or false? What about the statement 
(Rx Z)U(Zx R)=Rx R? 

10. Do you think the statement (R- Z) x N=(RxN)-(Z x N) is true, or false? Justify. 


1.6 Complement 


This section introduces yet another set operation, called the set complement. 
The definition requires the idea of a universal set, which we now discuss. 
When dealing with a set, we almost always regard it as a subset of 
some larger set. For example, consider the set of prime numbers P = 
{2,3,5,7,11,18,...$. If asked to name some things that are not in P, we 
might mention some composite numbers like 4 or 6 or 423. It probably 
would not occur to us to say that Vladimir Putin is not in P. True, Vladimir 
Putin is not in P, but he lies entirely outside of the discussion of what is a 
prime number and what is not. We have an unstated assumption that 


PCN 


because N is the most natural setting in which to discuss prime numbers. 
In this context, anything not in P should still be in N. This larger set N is 
called the universal set or universe for P. 

Almost every useful set in mathematics can be regarded as having some 
natural universal set. For instance, the unit circle is the set C = {(x,y)€ R?: 
x” + y? = 1, and since all these points are in the plane R? it is natural to 
regard R? as the universal set for C. In the absence of specifics, if A is a set, 
then its universal set is often denoted as U. We are now ready to define the 
complement operation. 


Definition 1.6 Let A bea set with a universal set U. The complement 
of A, denoted A, is the set A=U-A. 


Richard Hammack Book of Proof 


Complement 21 


Example 1.10 If P is the set of prime numbers, then 
P=N-P ={1,4,6,8,9, 10, 12,...}. 


Thus P is the set of composite numbers and 1. 


Example 1.11 Let A = {(x,x”):x€ R} be the graph of the equation y = x”. 
Figure 1.6(a) shows A in its universal set R?. The complement of A is A = 
R?-A = {(x,y)€R?: y 4x}, illustrated by the shaded area in Figure 1.6(b). 


A A 
(a) (b) 


Figure 1.6. A set and its complement 


Exercises for Section 1.6 
1. Let A = {4,3,6,7,1,9} and B = {5,6,8,4} have universal set U = {0,1,2,...,10}. Find: 


(a) A (d) AUA (g) A-B 
(b) B (e) A-A (h) AnB 
(ec) ANA (f) A-B (i) ANB 
2. Let A = {0,2,4,6,8} and B = {1,3,5,7} have universal set U = {0,1,2,...,8}. Find: 
(a) A (d) AUA (g) AnB 
(b) B (e) A-A (h) AnB 
(c) ANA (f) AUB (i) AxB 


3. Sketch the set X =[1,3]x [1,2] on the plane R?. On separate drawings, shade in 
the sets X and X n([0,2] x [0,3]). 

4. Sketch the set X =[—1,3] [0,2] on the plane R?. On separate drawings, shade in 
the sets X and X n([-2,4] x [-1,3]). 

5. Sketch the set X = {(x,y) €1 R?:1<.«?+y? <4} on the plane R?. On a separate 
drawing, shade in the set X. 

6. Sketch the set X = {(x,y)€R?: y <x} on R?. Shade in the set X. 


Free PDF version C\E 


22 Sets 


1.7 Venn Diagrams 


In thinking about sets, it is sometimes helpful to draw informal, schematic 
diagrams of them. In doing this we often represent a set with a circle 
(or oval), which we regard as enclosing all the elements of the set. Such 
diagrams can illustrate how sets combine using various operations. For 
example, Figures 1.7(a—c) show two sets A and B that overlap in a middle 
region. The sets AUB, AnB and A ~—B are shaded. Such graphical repre- 
sentations of sets are called Venn diagrams, after their inventor, British 
logician John Venn, 1834-1923. 


A BoA BoA B 
(a) (b) (c) 


Figure 1.7. Venn diagrams for two sets 


Though you are unlikely to draw Venn diagrams as a part of a proof of 
any theorem, you will probably find them to be useful “scratch work” devices 
that help you to understand how sets combine, and to develop strategies 
for proving certain theorems or solving certain problems. The remainder of 
this section uses Venn diagrams to explore how three sets can be combined 
using U and n. 

Let’s begin with the set AUBUC. Our definitions suggest this should 
consist of all elements which are in one or more of the sets A, B and C. 
Figure 1.8(a) shows a Venn diagram for this. Similarly, we think of ANBNC 
as all elements common to each of A, B and C, so in Figure 1.8(b) the region 
belonging to all three sets is shaded. 


A B A B 
AUBUC ANBNC 
(a) (b) 


Figure 1.8. Venn diagrams for three sets 


Richard Hammack Book of Proof 


Venn Diagrams 23 


We can also think of AN BNC as the two-step operation (ANB)NC. In 
this expression the set ANB is represented by the region common to both A 
and B, and when we intersect this with C we get Figure 1.8(b). This is a 
visual representation of the fact that AN BNC = (AnB)nC. Similarly, we 
have ANBnC= An(BnC). Likewise, AUBUC = (AUB)UC= AU(BUC). 

Notice that in these examples, where the expression either contains only 
the symbol u or only the symbol n, the placement of the parentheses is 
irrelevant, so we are free to drop them. It is analogous to the situations in 
algebra involving expressions (a+b)+c = a+(b+c) or (a-b)-c =a-(b-c). We tend 
to drop the parentheses and write simply a+b+c ora-b-c. By contrast, in 
an expression like (a+ b)-c the parentheses are absolutely essential because 
(a+ b)-c and a+(b-c) are generally not equal. 

Now let’s use Venn diagrams to help us understand the expressions 
(AUB)NC and AU(BNC), which use a mix of U and n. Figure 1.9 shows 
how to draw a Venn diagram for (A UB)NC. In the drawing on the left, the 
set AUB is shaded with horizontal lines, while C is shaded with vertical 
lines. Thus the set (A UB)NC is represented by the cross-hatched region 
where AUB and C overlap. The superfluous shadings are omitted in the 
drawing on the right showing the set (AUB)NC. 


——— aa B 


Figure 1.9. How to make a Venn diagram for (AUB)NC 


Now think about AU(BNnC). In Figure 1.10 the set A is shaded with 
horizontal lines, and BnC is shaded with vertical lines. The union AU(BNC) 
is represented by the totality of all shaded regions, as shown on the right. 


C 


iY 
L 


A B A B 


Figure 1.10. How to make a Venn diagram for AU(BNC) 


Free PDF version [S)= 


24 Sets 


Compare the diagrams for (A UB)NC and AU(BnC) in Figures 1.9 and 
1.10. The fact that the diagrams are different indicates that (A UB)NC # 
AU(BNC) in general. Thus an expression such as AUBNC is absolutely 
meaningless because we can’t tell whether it means (AUB)NC or AU(BNC). 
In summary, Venn diagrams have helped us understand the following. 


Important Points: 


e If an expression involving sets uses only u, then parentheses are optional. 
e If an expression involving sets uses only n, then parentheses are optional. 
¢ If an expression uses both U and n, then parentheses are essential. 


In the next section we will study types of expressions that use only u or 
only n. These expressions will not require the use of parentheses. 


Exercises for Section 1.7 

. Draw a Venn diagram for A, where A is a subset of a universal set U. 

. Draw a Venn diagram for B- A. 

. Draw a Venn diagram for (A-—B)nC. 

. Draw a Venn diagram for (A UB)-C. 

. Draw Venn diagrams for AU(BNC) and (AUB)n(AUC). Based on your drawings, 

do you think A U(BNC) = (AUB)n(AUC)? 

6. Draw Venn diagrams for An(BUC) and (AnB)uU(AnC). Based on your drawings, 
do you think An(BUC) = (AnB)U(AnC)? 

7. Suppose sets A and B are in a universal set U. Draw Venn diagrams for AnB 
and A UB. Based on your drawings, do you think it’s true that An B = AUB? 


a kw NW = 


8. Suppose sets A and B are in a universal set U. Draw Venn diagrams for AUB 
and AnB. Based on your drawings, do you think it’s true that AUB = AnB? 


9. Draw a Venn diagram for (ANB)-C. 
10. Draw a Venn diagram for (A—-B)UC. 


Following are Venn diagrams for expressions involving sets A, B and C. Write a 
corresponding expression. 


C C Cc C 
, : iy - ey . & ° (6 
A B A B A B A B 


Richard Hammack Book of Proof 


Indexed Sets 25 


1.8 Indexed Sets 


When a mathematical problem involves lots of sets, it is often convenient to 
keep track of them by using subscripts (also called indices). Thus instead of 
denoting three sets as A, B and C, we might instead write them as Aj, Ag 
and As. These are called indexed sets. 

Although we defined union and intersection to be operations that com- 
bine two sets, you by now have no difficulty forming unions and intersections 
of three or more sets. (For instance, in the previous section we drew Venn 
diagrams for the intersection and union of three sets.) But let’s take a 
moment to write down careful definitions. Given sets A;,A9,...,A,, the set 
A,;UAgqUA3U::-UA, consists of everything that is in at least one of the 
sets A;. Likewise Ayn A2gnAg3n-::NA, consists of everything that is common 
to all of the sets A;. Here is a careful definition. 


Definition 1.7 Suppose A;,Ao,...,A, are sets. Then 
A,UA2QUA3U:-UA, = {x:x€A; for at least one set A;, for 1<i<n}, 


AinAgnAgn::nA, = {x :xe€A; for every set Aj, for 1<i<n}. 


But if the number n of sets is large, these expressions can get messy. To 
overcome this, we now develop some notation akin to sigma notation. You 
already know that sigma notation is a convenient symbolism for expressing 
sums of many numbers. Given numbers a1,qa2,qa3,...,d@,, then 


n 
Yi aj =A, +aAQg+A3T'''+An. 
i=l 

Even if the list of numbers is infinite, the sum 


[e,2) 
aes =A, TAQ7T ART +a, 


i 
any 


is often still meaningful. The notation we are about to introduce is very 


similar to this. Given sets Aj,A9,A3, ..., An, we define 
n n 
LJ A; = A, UAgUAgU-:-UA, and (Ai = A1nAgnAgn--:N An. 
i=l i=l 


Example 1.12 Suppose A, = {0,2,5}, Ao = {1,2,5} and As = {2,5,7}. Then 


3 3 
LJ Ai = A1UA2UA3 = {0,1,2,5,7} and ()Ai=A1nA2nAs = {2,5}. 
i=1 


i=1 i= 


Free PDF version [S)= 


26 Sets 


This notation is also used when the list of sets A1, Ag, A3, Ag,... is infinite: 


CO 
LJA; = A,UAgUA3U-:- = {x:x€A; for at least one set A; with 1< i}. 


AiNAgnAgn::: = {x:x€A; for every set A; with 1 <i}. 


Ze 
om 
I 


Example 1.13 This example involves the following infinite list of sets. 
Aj = {- 1,0, 1}, Ag = {—2,0,2}, A3 = {-3,0,3}, nt Aj = {-i,0,i}, a 


Observe that |) A; =Z, and {)A; = {0}. 
i=l i=1 


Here is a useful twist on our new notation. We can write 
3 
(Ag cee UNAS, 
i=l 


which is understood to be the union of the sets A; for i= 1,2,3. Likewise: 


3 
MA: = ) Ai 
i=1 


ie{1,2,3} 
CO 
UA: = UAi 
i=1 ieN 
CO 
Ai = Ai 
i=1 ieN 


Here we are taking the union or intersection of a collection of sets A; where 
iis an element of some set, be it {1,2,3} or N. In general, the way this works 
is that we will have a collection of sets A; for ie I, where I is the set of 
possible subscripts. The set J is called an index set. 

It is important to realize that the set J need not even consist of integers. 
(We could subscript with letters or real numbers, etc.) Since we are pro- 
grammed to think of i as an integer, let’s make a slight notational change: 
Use a, not i, to stand for an element of J. Thus we are dealing with a 
collection of sets Ag for ae I. This leads to the following definition. 


Definition 1.8 If A, is a set for every a in some index set I # @, then 


LJAa = {x : x€ Ag for at least one set Ag with ac I} 


ael 


(\Aa = {x : x€ Ag for every set Ag with ac I}. 


ael 


Richard Hammack Book of Proof 


Indexed Sets 27 


Example 1.14 In this example, all sets Aq are all subsets of the plane R?. 
Each a belongs to the index set J =[0,2]= {xe R : 0<x <2}, which is the set 
of all real numbers between 0 and 2. For each number ae I, define Ag to be 
the set Ag =[a,2] x [0,a], which is the rectangle on the xy-plane whose base 
runs from a to 2 on the x-axis, and whose height is a. Some of these are 
shown shaded below. (The dotted diagonal line y =x is not a part of any of 
the sets, but is shown for clarity, as the upper left corner of each Ag touches 
it.) Note that these sets are not indexed with just integers. For example, as 
V2¢€1, there isaset A ya Which shown below on the right. 


3 
4 


Note that Ao = [0,2] x [0,0] = [0,2] x {0} is the interval [0,2] on the x-axis 
(a “flat” rectangle). Also, Ag = [2,2] x [0,2] = {2} x [0,2] is the vertical side of 
the dotted triangle in the above pictures. 

Now consider the infinite union (J Aq. It is the shaded triangle shown 


ael 
below, because any point (x,y) on this triangle belongs to the set A,, and is 
therefore in the union. (And any point not on the triangle is not in any A,.) 


Now let’s work out the intersection (| Aq. Notice that the point (2,0) on 


the x-axis is the lower right corner of aay set Ag, so (2,0)€ Ag for any ael. 
Therefore the point (2,0) is in the intersection of all the Ag. But any other 
point (x, y) 4 (2,0) on the triangle does not belong to all of the sets Ag. The 
reason is that if x < 2, then (x,y)¢ Aq for any x <a<2. (Check this.) And if 
x = 2, then (x,y)¢ Aq for any 0< a<_y. Consequently 


1 Aa = {2,0}. 


ael 


This intersection consists of only one element, the point (2,0). 


Free PDF version [S)= 


28 Sets 


Example 1.15 Here our sets are indexed by R®. For any (a,b) € R®, let Pia,5) 
be the following subset of R?: 


Pras) = {(x, 9,2) ER? : ax+ by =O}. 


In words, given a point (a,b) € R’, the corresponding set Pia) consists of 
all points (x,y,z) in R® that satisfy the equation ax+ by =0. From previous 
math courses you will recognize this as a plane in R’, that is, P(a,p) is a plane 
in R°. Moreover, since any point (0,0,z) on the z-axis automatically satisfies 
ax+by =0, each P(q,,) contains the z-axis. 

Figure 1.11 (left) shows the set Pi) = {(x,y,z)€ R? : x+2y = 0}. It is the 
vertical plane that intersects the xy-plane at the line x + 2y =0. 


Figure 1.11. The sets P(,.4) are vertical planes containing the z-axis. 


For any point (a,b) € R? with (a,b) 4 (0,0), we can visualize P(a,p) as the 
vertical plane that cuts the xy-plane at the line ax+ by =0. Figure 1.11 
(right) shows a few of the P(,,,). Since any two such planes intersect along 
the z-axis, and because the z-axis is a subset of every Pip), it is immediately 
clear that 

(1 Pap) = {(0,0,z) : z€R} = “the z-axis”. 
(a,b)eER? 

For the union, note that any given point (a,b,c) € R? belongs to the set 
P(-b,a) because (x,y,z) = (a,b,c) satisfies the equation —bx+ay=0. (In fact, 
any (a,b,c) belongs to the special set P(oo) = R®, which is the only Pap) that 
is not a plane.) Since any point in R® belongs to some P(q,y) we have 


3 
U Pwa,b) = R . 
(a,b)ER? 


Richard Hammack Book of Proof 


Indexed Sets 29 


Exercises for Section 1.8 


1. Suppose A; = {a,b,d,e,g,f}, Ao = {a,b,c,d}, A3 ={b,d,a} and A4 = {a,b,h}. 
4 


4 
(a) UAi= (b) [)Ai= 
i=1 i=1 
Ai = {0,2,4,8,10,12,14,16, 18,20, 22,24}, 
2. Suppose { Az = {0,3,6,9,12,15,18,21,24}, 
As = {0,4,8,12,16,20,24}. 
3 3 
(a) UAi= (b) [)Ai= 
i=1 i=1 


3. For each neéN, let A, = {0,1,2,3,...,n}. 


(a) UAi= (b) (Ai= 
ieN ieN 
4. For each neéN, let A, = {-2n,0,2n}. 
(a) UAi= (b) (Ai= 
ieN ieN 
5. (a) Uli,i+ll= (b) ()li,i+1l= 
ieN ieN 
6. (a) LJl0,i+11= (b) ()l0,i+1]= 
ieN ieN 
7. (a) UR«li,i+11= (b) ()Rxli,i+11= 
ieN ieN 
8. (a) LU {a}x[0,1]= (b) () {a} x[0,1]= 
aeR aeR 
9. (a) U X= (b) []) X= 
Xe PAN) Xe PAN) 
10. (a) Ux, 1] *[0,x71= (b)  () Ix, 11x [0,x7]= 
xe[0,1] xe[0,1] 


11. Is (] Aa S LU Aa always true for any collection of sets Ag with index set I? 


ael ael 


12. If () Aa = LU Aa, what do you think can be said about the relationships between 
ael ael 


the sets A,? 


13. If J#@ and JC], does it follow that J) Aa S J) Aa? What about (] Aas [) Aa? 


aed ael aed ael 


14. If J#¢ and JCI, does it follow that (] Aa¢ (| Aa? Explain. 


ael acd 


Free PDF version [S)= 


30 Sets 


1.9 Sets That Are Number Systems 


In practice, the sets we tend to be most interested in often have special 
properties and structures. For example, the sets Z, Q and R are familiar 
number systems: Given such a set, any two of its elements can be added 
(or multiplied, etc.) together to produce another element in the set. These 
operations obey the familiar commutative, associative and distributive 
properties that we all have dealt with for years. Such properties lead to 
the standard algebraic techniques for solving equations. Even though we 
are concerned with the idea of proof, we will not find it necessary to define, 
prove or verify such properties and techniques; we will accept them as the 
ground rules upon which our further deductions are based. 

We also accept as fact the natural ordering of the elements of N,Z,Q and 
R, so that (for example) the meaning of “5 < 7” is understood and does not 
need to be justified or explained. Similarly, if x < y and a 40, we know that 
ax <ay or ax=ay, depending on whether a is positive or negative. 

Another thing that our ingrained understanding of number order tells 
us is that any non-empty subset of N has a smallest element. In other words, 
if ACN and A # g, then there is an element xp € A that is smaller than 
every other element of A. (To find it, start at 1, then move in increments to 
2, 3, 4, etc., until you hit a number xo € A; this is the smallest element of 
A.) Similarly, given b € Z, any non-empty subset Ac {b,b+1,b+2,b+3,...} 
has a smallest element. This fact is sometimes called the well-ordering 
principle. There is no need to remember this term, but do be aware that 
we will use this simple, intuitive idea often in proofs, usually without a 
second thought. 

The well-ordering principle seems innocent enough, but it actually says 
something very fundamental and special about the positive integers N. 
In fact, the corresponding statement for the positive real numbers is false: 
The subset A = {4:n€N} of the positive reals has no smallest element 
because for any xo = + € A we might pick, there is a smaller element +, € A. 

One consequence of the well-ordering principle (as we will see below) is 
the familiar fact that any integer a can be divided by a non-zero integer b, 
resulting in a quotient gq and remainder r. For example, b = 3 goes into 
a=17 q=5 times with remainder r = 2. In symbols, 17=5-3+2, ora=qb+r. 
This significant fact is called the division algorithm. 


Fact 1.5 (Division Algorithm) Given integers a and b with b > 0, 
there exist unique integers gq and r for which a= qb+rand0<r<b. 


Richard Hammack Book of Proof 


Sets That Are Number Systems 31 


Although there is no harm in accepting the division algorithm without 
proof, note that it does follow from the well-ordering principle. Here’s how: 
Given integers a,b with 6 > 0, form the set 


A={a-xb :x€Z, 0<a-—xb}¢ {0,1,2,3,...}. 


This is the set of non-negative integers got by subtracting multiples of b 
from a. (Example: Ifa = 17 and b =3, then we get A = {2,5,8, 11, 14,17,20,...} 
by subtracting multiples of 3 from 17. Note that the remainder r = 2 of 17+3 
is the smallest element of A.) In general, by the well-ordering principle, the 
set A= {a—xb:x€Z,0<a-xb} has a smallest element r. Then r=a-qb 
for some x=q€Z,soa=qb+r. Because re A€ {0,1,2,3...}, we know 0<r. 
In addition, it cannot happen that r = b: If this were the case, then the 
non-negative number r—6 =(a-—qb)—b =a—(q+1)b having form a—xb would 
be a smaller element of A than r, and r was explicitly chosen as the smallest 
element of A. Since it is not true that r= b, it must be that r <b. Therefore 
0<r<b. We’ve now produced a g and an r for which a=qb+rand0s<r<6b. 
(Exercise 28 of Chapter 7 asks you to prove g and r are wnique in the sense 
that no other values of g and r have these properties.) 


Moving on, it is time to clarify a small issue. This chapter asserted that 
all of mathematics can be described with sets. But at the same time we 
maintained that some mathematical entities are not sets. (For instance, 
our approach was to say that an individual number, such as 5, is not itself 
a set, though it may be an element of a set.) We have made this distinction 
because we need a place to stand as we explore sets: After all, it would 
appear suspiciously circular to declare that every mathematical entity is a 
set, and then go on to define a set as a collection whose members are sets! 

But to most mathematicians, saying “The number 5 is not a set,” is like 
saying “The number 5 is not a number.” 

The truth is that any number can itself be understood as a set. One 
way to do this is to begin with the identification 0 = ¢. Then 1 = {@} = 
{0}, and 2 = {¢,{}} = {0,1}, and 3 = {9, {9}, {d, {d}}} = {0,1,2}. In general the 
natural number n is the set n = {0,1,2,...,n—1} of the n numbers (which are 
themselves sets) that come before it. 

We will not undertake such a study here, but the elements of the number 
systems Z, Q and R can all be defined in terms of sets. (Even the operations 
of addition, multiplication, etc., can be defined in set-theoretic terms.) In 
fact, mathematics itself can be regarded as the study of things that can 
be described as sets. Any mathematical entity is a set, whether or not we 
choose to think of it that way. 


Free PDF version [S)= 


32 Sets 


1.10 Russell’s Paradox 


This section contains some background information that may be interesting, 
but is not used in the remainder of the book. 

The philosopher and mathematician Bertrand Russell (1872-1970) did 
groundbreaking work on the theory of sets and the foundations of math- 
ematics. He was probably among the first to understand how the misuse 
of sets can lead to bizarre and paradoxical situations. He is famous for an 
idea that has come to be known as Russell’s paradox. 

Russell’s paradox involves the following set of sets: 


A={X: Xisasetand X¢X }. (1.1) 


In words, A is the set of all sets that do not include themselves as elements. 
Most sets we can think of are in A. The set Z of integers is not an integer 
(i.e., Z¢ Z) and therefore Ze A. Also G€ A because ¢ is a set and ®¢@. 

Is there a set that is not in A? Consider B= {{{{...}}}}. Think of B as a 
box containing a box, containing a box, containing a box, and so on, forever. 
Or a set of identical Russian dolls, nested one inside the other, endlessly. 
The curious thing about B is that it has just one element, namely B itself: 


Bett HHT 


“YS 
B 
Thus Be B. As B does not satisfy B ¢ B, Equation (1.1) says B¢ A. 

Russell’s paradox arises from the question “Is A an element of A?” 

For a set X, Equation (1.1) says X ¢ A means the same thing as X ¢ X. 
So for X = A, the previous line says Ae A means the same thing as A¢ A. 
Conclusions: If A € A is true, then it is false. If A € A is false, then it is true. 
This is Russell’s paradox. 

Initially Russell’s paradox sparked a crisis among mathematicians. How 
could a mathematical statement be both true and false? This seemed to be 
in opposition to the very essence of mathematics. 

The paradox instigated a very careful examination of set theory and 
an evaluation of what can and cannot be regarded as a set. Eventually 
mathematicians settled upon a collection of axioms for set theory—the 
so-called Zermelo-Fraenkel axioms. One of these axioms is the well- 
ordering principle of the previous section. Another, the axiom of foundation, 
states that no non-empty set X is allowed to have the property X nx #@ for 
all its elements x. This rules out such circularly defined “sets” as B = {B} 
mentioned above. If we adhere to these axioms, then situations like Russell’s 


Richard Hammack Book of Proof 


Russell’s Paradox 33 


paradox disappear. Most mathematicians accept all this on faith and happily 
ignore the Zermelo-Fraenkel axioms. Paradoxes like Russell’s do not tend 
to come up in everyday mathematics—you have to go out of your way to 
construct them. 

Still, Russell’s paradox reminds us that precision of thought and lan- 
guage is an important part of doing mathematics. The next chapter deals 
with the topic of logic, a codification of thought and language. 


Additional Reading on Sets. For a lively account of Bertrand Russell’s 
life and work (including his paradox), see the graphic novel Logicomix: An 
Epic Search For Truth, by Apostolos Doxiadis and Christos Papadimitriou. 
Also see cartoonist Jessica Hagy’s online strip Indexed—it is based largely 
on Venn diagrams. 


Free PDF version [S)= 


CHAPTER 2 


Logic 


ogic is a systematic way of thinking that allows us to parse the meanings 

of sentences and to deduce new information from old information. You 

use logic informally in everyday life and certainly also in doing mathematics. 

For example, say you are working with a certain circle (call it “Circle X”), 
and suppose you have available the following two pieces of information. 


1. Circle X has a radius of 3 units. 
2. If any circle has radius r, then its area is mr? square units. 


You have no trouble putting these two facts together to get: 
3. Circle X has area 9x square units. 


In doing this you are using logic to combine existing information to 
produce new information. Because deducing new information is central to 
mathematics, logic plays a fundamental role. This chapter is intended to 
give you a sufficient mastery of it. 

It is important to realize that logic is a process of deducing information 
correctly, not just deducing correct information. For example, suppose we 
were mistaken and Circle X actually had a radius of 4, not 3. Let’s look at 
our exact same argument again. 


1. Circle X has a radius of 3 units. 
2. If any circle has radius r, then its area is mr? square units. 


3. Circle X has area 9x square units. 


The sentence “Circle X has a radius of 3 units.” is now untrue, and so is our 
conclusion “Circle X has area 9x square units.” But the logic is perfectly 
correct; the information was combined correctly, even if some of it was false. 
This distinction between correct logic and correct information is significant 
because it is often important to follow the consequences of an incorrect 
assumption. Ideally, we want both our logic and our information to be 
correct, but the point is that they are different things. 


Statements 35 


In proving theorems, we apply logic to information that is considered 
obviously true (such as “Any two points determine exactly one line.”) or is 
already known to be true (e.g., the Pythagorean theorem). If our logic is 
correct, then anything we deduce from such information will also be true 
(or at least as true as the “obviously true” information we began with). 
2.1 Statements 


The study of logic begins with statements. A statement is a sentence or 
a mathematical expression that is either definitely true or definitely false. 
You can think of statements as pieces of information that are either correct 
or incorrect. Thus statements are pieces of information that we might 
apply logic to in order to produce other pieces of information (which are also 
statements). 


Example 2.1 Here are some examples of statements. They are all true. 
If a circle has radius r, then its area is mr? square units. 
Every even number is divisible by 2. 
2€Z 
V2¢Z 
N¢EZ 
The set {0,1,2} has three elements. 


Some right triangles are isosceles. 


Example 2.2. Here are some additional statements. They are all false. 
All right triangles are isosceles. 
5=2 
V2¢R 
ZEN 
{0,1,2}nN=@ 


Example 2.3. Here non-statements are paired with similar statements. 


NOT a statement: Statement: 

Add 5 to both sides. Adding 5 to both sides of x-— 5 = 87 gives x = 42. 
Z 42e€Z 

42 42 is not a number. 

What is the solution of 2x = 84? | The solution of 2x = 84 is 42. 


Free PDF version [S)= 


36 Logic 


Example 2.4 We will often use the letters P, Q, R and S to stand for 
specific statements. When more letters are needed we can use subscripts. 
Here are more statements, designated with letters. You decide which of 
them are true and which are false. 


P: For every integer n > 1, the number 2” — 1 is prime. 
Q : Every polynomial of degree n has at most n roots. 
R: The function f(x) = x? is continuous. 

S1:Z2¢@ 

S2:{0,-1,-2}nN=@ 


Designating statements with letters (as was done above) is a very useful 
shorthand. In discussing a particular statement, such as “The function 
f (x) = x? is continuous,” it is convenient to just refer to it as R to avoid having 
to write or say it many times. 

Statements can contain variables. Here is an example. 


P: If an integer x is a multiple of 6, then x is even. 


This is a sentence that is true. (All multiples of 6 are even, so no matter 
which multiple of 6 the integer x happens to be, it is even.) Since the sentence 
P is definitely true, it is a statement. When a sentence or statement P 
contains a variable such as x, we sometimes denote it as P(x) to indicate that 
it is saying something about x. Thus the above statement can be denoted as 


P(x): If an integer x is a multiple of 6, then x is even. 


A statement or sentence involving two variables might be denoted P(x, y), 
and so on. 

It is quite possible for a sentence containing variables to not be a state- 
ment. Consider the following example. 


Q(x): The integer x is even. 


Is this a statement? Whether it is true or false depends on just which integer 
x is. It is true if x =4 and false if x = 7, etc. But without any stipulations on 
the value of x it is impossible to say whether Q(x) is true or false. Since it 
is neither definitely true nor definitely false, Q(x) cannot be a statement. 
A sentence such as this, whose truth depends on the value of one or more 
variables, is called an open sentence. The variables in an open sentence 
(or statement) can represent any type of entity, not just numbers. Here is 
an open sentence where the variables are functions: 


Richard Hammack Book of Proof 


Statements 37 


R(f,g): The function f is the derivative of the function g. 


This open sentence is true if f(x) = 2x and g(x) =x?. It is false if f(x) = x° 
and g(x) =x”, etc. A sentence such as R(f,g) (that involves variables) can be 
denoted either as R(f,g) or just R. We use the expression R(f,g) when we 
want to emphasize that the sentence involves variables. 

We will have more to say about open sentences later, but for now let’s 
return to statements. 

Statements are everywhere in mathematics. Any result or theorem 
that has been proved true is a statement. The quadratic formula and the 
Pythagorean theorem are both statements: 


—b+vb*-4ac 


P: The solutions of the equation ax? +bx+c=0 are x= oe 


Q@: Ifaright triangle has legs of lengths a and b and hypotenuse of 
length c, then a? + 6? = c?. 


Here is a very famous statement, so famous, in fact, that it has a name. 
It is called Fermat’s last theorem after Pierre Fermat, a seventeenth- 
century French mathematician who scribbled it in the margin of a book. 


R: For all numbers a,b,c,n €N with n > 2, it is the case that a” +b” 4c”. 


Fermat believed this statement to be true. He noted that he could prove it 
was true, except the book’s margin was too narrow to contain his proof. It 
is doubtful that he really had a correct proof in mind, for after his death 
many generations of brilliant mathematicians tried unsuccessfully to prove 
that his statement was true (or false). Finally, in 1993, Andrew Wiles of 
Princeton University announced that he had devised a proof. Wiles had 
worked on the problem for over seven years, and his proof runs through 
hundreds of pages. The moral of this story is that some true statements 
are not obviously true. 

Here is another statement famous enough to be named. It was first 
posed in the eighteenth century by the German mathematician Christian 
Goldbach, and thus is called the Goldbach conjecture: 


S: Every even integer greater than 2 is a sum of two prime numbers. 


You must agree that S is either true or false. It appears to be true, because 
when you examine even numbers that are bigger than 2, they seem to be 
sums of two primes: 4=2+2, 6=3+3, 8=3+5, 10=5+5, 12=5+7, 
100 = 17+ 83 and so on. But that’s not to say there isn’t some large even 


Free PDF version [S)= 


38 Logic 


number that’s not the sum of two primes. If such a number exists, then S 
is false. The thing is, in the over 260 years since Goldbach first posed this 
problem, no one has been able to determine whether it’s true or false. But 
since it is clearly either true or false, S is a statement. 

This book is about the methods that can be used to prove that S (or any 
other statement) is true or false. To prove that S is true, start with obvious 
statements (or other statements that have been proven true) and use logic 
to deduce more and more complex statements until finally we obtain the 
statement S. Of course some statements are more difficult to prove than 
others, and S appears to be notoriously difficult; we will concentrate on 
statements that are easier to prove. 

But the point is this: In proving that statements are true, we use logic 
to help us understand statements and to combine pieces of information 
to produce new pieces of information. In the next several sections we 
explore some standard ways that statements can be combined to form new 
statements, or broken down into simpler statements. 


Exercises for Section 2.1 


Decide whether or not the following are statements. In the case of a statement, 
say if it is true or false, if possible. 


. Every real number is an even integer. 

. Every even integer is a real number. 

. If x and y are real numbers and 5x =5y, then x= y. 
. Sets Z and N. 

. Sets Z and N are infinite. 

. Some sets are finite. 

. The derivative of any polynomial of degree 5 is a polynomial of degree 6. 
~ N¢é AN). 

- cos(x) = —1. 

. (Rx NA(NxR)=NxN. 

. The integer x is a multiple of 7. 


eomntan»rh © WN = 


oll on Ol oe 
i a) 


. If the integer x is a multiple of 7, then it is divisible by 7. 


_ 
iv) 


. Either x is a multiple of 7, or it is not. 


_ 
~ 


. Call me Ishmael. 


_ 
oO 


. In the beginning, God created the heaven and the earth. 


Richard Hammack Book of Proof 


And, Or, Not 39 


2.2 And, Or, Not 


The word “and” can be used to combine two statements to form a new 
statement. Consider for example the following sentence. 


R,: The number 2 is even and the number 3 is odd. 


We recognize this as a true statement, based on our ingrained understanding 
of the meaning of the word “and.” Notice that R, is made up of two simpler 
statements: 


P.: The number 2 is even. 
Q: The number 8 is odd. 


These are joined together by the word “and” to form the more complex 
statement R,. The statement R, asserts that P and Q are both true. Since 
both P and Q are in fact true, the statement R, is also true. 

Had one or both of P and Q been false, then R, would be false. For 
instance, each of the following statements is false. 


R»: The number 1 is even and the number 3 is odd. 
R3: The number 2 is even and the number 4 is odd. 
R4: The number 3 is even and the number 2 is odd. 


From these examples we see that any two statements P and Q can be 
combined to form a new statement “P and Q.” In the spirit of using letters 
to denote statements, we now introduce the special symbol a to stand for the 
word “and.” Thus if P and Q are statements, PQ stands for the statement 
“P and Q.” The statement P \Q is true if both P and Q are true; otherwise 
it is false. This is summarized in the following table, called a truth table. 


P/Q) PaQ 
tT)? | £ 
T|F|| F 
F|T| F 
F\F| F 


In this table, T stands for “True,” and F stands for “False.” (T and F are 
called truth values.) Each line lists one of the four possible combinations 
or truth values for P and Q, and the column headed by P AQ tells whether 
the statement P AQ is true or false in each case. 


Free PDF version [S)= 


40 Logic 


Statements can also be combined using the word “or.” Consider the 
following four statements. 


S;: The number 2 is even or the number 3 is odd. 
So: The number 1 is even or the number 3 is odd. 
S3: The number 2 is even or the number 4 is odd. 
S4: The number 3 is even or the number 2 is odd. 


In mathematics, the assertion “P or Q” is always understood to mean that 
one or both of P and Q is true. Thus statements Sj, Se, S3 are all true, 
while S4 is false. The symbol v is used to stand for the word “or.” So if P 
and Q are statements, P vQ represents the statement “P or Q.” Here is the 
truth table. 


SP") P A] 
TY] Sy] Oo 
AAR < 


It is important to be aware that the meaning of “or” expressed in the 
above table differs from the way it is often used in everyday conversation. 
For example, suppose a university official makes the following threat: 


You pay your tuition or you will be withdrawn from school. 


You understand that this means that either you pay your tuition or you will 
be withdrawn from school, but not both. In mathematics we never use the 
word “or” in such a sense. For us “or” means exactly what is stated in the 
table for v. Thus P vQ being true means one or both of P and Q is true. If 
we ever need to express the fact that exactly one of P and Q is true, we use 
one of the following constructions: 


P or Q, but not both. 
Either P or Q. 
Exactly one of P or Q. 


If the university official were a mathematician, he might have qualified his 
statement in one of the following ways. 


Pay your tuition or you will be withdrawn from school, but not both. 


Either you pay your tuition or you will be withdrawn from school. 


Richard Hammack Book of Proof 


And, Or, Not Al 


To conclude this section, we mention another way of obtaining new 
statements from old ones. Given any statement P, we can form the new 
statement “It is not true that P.” For example, consider the following 
statement. 


The number 2 is even. 


This statement is true. Now change it by inserting the words “It is not true 
that” at the beginning: 


It is not true that the number 2 is even. 


This new statement is false. 


For another example, starting with the false statement “2 € @,” we get 
the true statement “It is not true that 2€ @.” 


We use the symbol ~ to stand for the words “It’s not true that,” so ~ P 
means “It’s not true that P.” We can read ~ P simply as “not P.” Unlike 
A and v, which combine two statements, the symbol ~ just alters a single 
statement. Thus its truth table has just two lines, one for each possible 
value of P. 


P| ~P 
T || F 
Fl T 


The statement ~ P is called the negation of P. The negation of a specific 
statement can be expressed in numerous ways. Consider 


P.: The number 2 is even. 
Here are several ways of expressing its negation. 


~ P: It’s not true that the number 2 is even. 
~P:Itis false that the number 2 is even. 
~ P: The number 2 is not even. 


In this section we’ve learned how to combine or modify statements with 
the operations ~, v and ~. Of course we can also apply these operations 
to open sentences or a mixture of open sentences and statements. For 
example, (x is an even integer) A (3 is an odd integer) is an open sentence 
that is a combination of an open sentence and a statement. 


Free PDF version [S)= 


42 Logic 


Exercises for Section 2.2 

Express each statement or open sentence in a symbolic form such as PAQ, PvQ, 
Pv ~Q or ~P, etc. Be sure to also state exactly what statements P and Q stand for. 
. The number 8 is both even and a power of 2. 

. The matrix A is not invertible. 

xy 4. x<y 5. y=x 


. The number x equals zero, but the number y does not. 
. At least one of the numbers x and y equals 0. 
. xe A-B 10. xe AUB 11. Ae {Xe AN) : |X| <oo} 
12. Happy families are all alike, but each unhappy family is unhappy in its own way. 
(Leo Tolstoy, Anna Karenina) 
13. Human beings want to be good, but not too good, and not all the time. 
(George Orwell) 
14. A man should look for what is, and not for what he thinks should be. 
(Albert Einstein) 


1 
2 
3 
6. There is a quiz scheduled for Wednesday or Friday. 
7. 
8 
9 


2.3 Conditional Statements 


There is yet another way to combine two statements. Suppose we have in 
mind a specific integer a. Consider the following statement about a. 


R: If the integer a is a multiple of 6, then a is divisible by 2. 


We immediately spot this as a true statement based on our knowledge of 
integers and the meanings of the words “if” and “then.” If integer a is a 
multiple of 6, then a is even, so therefore a is divisible by 2. Notice that R is 
built up from two simpler statements: 


P: The integer a is a multiple of 6. 
Q: The integer a is divisible by 2. 
R: IfP, then q. 


In general, given any two statements P and Q whatsoever, we can form 
the new statement “If P, then Q.” This is written symbolically as P > Q 
which we read as “If P, then Q,” or “P implies Q.” Like a and v, the symbol 
=> has a very specific meaning. When we assert that the statement P > Q 
is true, we mean that if P is true then Q must also be true. (In other words 
we mean that the condition P being true forces Q to be true.) A statement 
of form P > Q is called a conditional statement because it means Q will 
be true under the condition that P is true. 


Richard Hammack Book of Proof 


Conditional Statements 43 


Think of P > Q as a promise that whenever P is true, Q will be true also. 
There is only one way this promise can be broken (i.e., be false), namely if P 
is true but Q is false. So the truth table for the promise P => Q is as follows: 


Q | P=@ 


ot a Be Bl) 
ot a A> a| 


AAA 


F 


Perhaps you are bothered by how P => Q is true in the last two lines. Here 
is an example to explain it. Suppose your professor makes this promise: 


If you pass the final exam, then you will pass the course. 
Your professor is making the promise 
(You pass the exam) => (You pass the course). 


Under what circumstances did she lie? There are four possible scenarios, 
depending on whether or not you passed the exam and whether or not you 
passed the course. These scenarios are tallied in the following table. 


You pass exam | You pass course || (You pass exam) > (You pass course) 
T T T 
T F F 
F T T 
F F T 


The first row is the scenario in which you pass the exam and you pass 
the course. Clearly the professor kept her promise, so the T in the third 
column indicates that she told the truth. In the second row, you passed the 
exam but failed the course. In this case your professor broke her promise, 
and the F in the third column indicates that what she said was untrue. 

The third row describes the scenario in which you failed the exam but 
still passed the course. How could that happen? Maybe your professor felt 
sorry for you. But that doesn’t make her a liar. Her only promise was that 
if you passed the exam then you would pass the course. She did not say 
passing the exam was the only way to pass the course. Since she didn’t lie, 
then she told the truth, so there is a T in the third column. 

Finally look at the fourth row: you failed the exam and failed the course. 
Your professor certainly did not lie to you. Hence the T in the third column. 


Free PDF version [S)= 


44 Logic 


For another example, consider this statement: 
If this month is September, then there is an equinox this month. 


An equinox is a day for which there are equal hours of darkness and light. 
There are two equinoxes per year, one in September and the other in March. 
The above statement is thus unquestionably true, for it asserts correctly 
that if the current month is September, then an equinox will occur this 
month. In symbolic form, our statement is 


(This month is September) => (There is an equinox this month). 


This statement is true, but the open sentences P: “This month is September,” 
and Q: “There is an equinox this month,” are either true or false, depending 
on what month it is. But P > Q is always true. This is shown below for 
three (out of 12) months. Notice how P = Q is true, even when P is false. 


This month There is an ae month ] bes ean s " 


Sept. T T z 
March F T . 
May a B : 


As P=>@ is a true statement in this particular example, there is no month 
with P true and Q false. (Unless we imagine that Earth is destroyed by an 
asteroid before September 21, a possibility that we shall not entertain.) 


In mathematics, whenever we encounter the construction “If P, then Q,” 
it means exactly what the truth table for > expresses. Of course there are 
other grammatical constructions that also mean P > Q. Here is a summary 
of the main ones. The meaning of each is encapsulated by the table for >. 


If P, then Q. 

Q if P. 

Q whenever P. P|\Q|P=@Q 
Q, provided that P. 

Whenever P, then also Q. Po =a £ 
P is a sufficient condition for Q. @ TF F 
For Q, it is sufficient that P. F(T T 
Q is a necessary condition for P. F\F T 


For P, it is necessary that Q. 
P only if Q. 


Richard Hammack Book of Proof 


Conditional Statements 45 


These can all be used in the place of (and mean exactly the same thing 
as) “If P, then Q.” You should analyze the meaning of each one and convince 
yourself that it captures the meaning of P > Q. For example, P > Q means 
the condition of P being true is enough (i.e., sufficient) to make Q true; 
hence “P is a sufficient condition for Q.” 

The wording can be tricky. An everyday situation may help clarify it. 
For example, consider your professor’s promise: 


(You pass the exam) > (You pass the course). 


This means that your passing the exam is a sufficient (though perhaps not 
necessary) condition for your passing the course. Thus your professor might 
just as well have phrased her promise in one of the following ways. 


Passing the exam is a sufficient condition for passing the course. 


For you to pass the course, it is sufficient that you pass the exam. 


However, when we want to say “If P, then Q” in everyday conversation, 
we do not normally express this as “Q is a necessary condition for P” or 
“P only if Q.” But such constructions are not uncommon in mathematics. 
To understand why they make sense, notice that P > Q being true means 
that it’s impossible that P is true but Q is false, so in order for P to be true 
it is necessary that Q is true; hence “Q is a necessary condition for P.” And 
this means that P can only be true if Q is true, i.e., “P only if Q.” 


Exercises for Section 2.3 

Without changing their meanings, convert each of the following sentences into a 
sentence having the form “If P, then Q.” 

. A matrix is invertible provided that its determinant is not zero. 

. For a function to be continuous, it is sufficient that it is differentiable. 

. For a function to be continuous, it is necessary that it is integrable. 

A function is rational if it is a polynomial. 

An integer is divisible by 8 only if it is divisible by 4. 

Whenever a surface has only one side, it is non-orientable. 

. Aseries converges whenever it converges absolutely. 


. A geometric series with ratio r converges if |r| < 1. 


CONAAR WY 


. A function is integrable provided the function is continuous. 


_ 
—) 


. The discriminant is negative only if the quadratic equation has no real solutions. 


_ 
_ 


. You fail only if you stop writing. (Ray Bradbury) 


Free PDF version [S)= 


46 Logic 


12. People will generally accept facts as truth only if the facts agree with what they 
already believe. (Andy Rooney) 


13. Whenever people agree with me I feel I must be wrong. (Oscar Wilde) 


2.4 Biconditional Statements 


It is important to understand that P > Q is not the same as Q => P. To see 
why, suppose that a is some integer and consider the statements 


(aisa multiple of 6) => (a is divisible by 2), 
(a is divisible by 2) => (ais a multiple of 6). 


The first statement asserts that if a is a multiple of 6 then a is divisible 
by 2. This is clearly true, for any multiple of 6 is even and therefore divisible 
by 2. The second statement asserts that if a is divisible by 2 then itis a 
multiple of 6. This is not necessarily true, for a = 4 (for instance) is divisible 
by 2, yet not a multiple of 6. Therefore the meanings of P>Q and Q=>P 
are in general quite different. The conditional statement Q = P is called 
the converse of P > Q, so a conditional statement and its converse express 
entirely different things. 

But sometimes, if P and Q are just the right statements, it can happen 
that P>Q and Q = P are both necessarily true. For example, consider the 
statements 


(aiseven) => (ais divisible by 2), 


(a is divisible by 2) => (ais even). 


No matter what value a has, both of these statements are true. Since both 
P=>@Q andQ=>P are true, it follows that (P > Q)A(Q => P) is true. 

Let’s introduce a new symbol © to express the meaning of the statement 
(P > Q)A(Q =P). The expression P = Q is understood to have exactly the 
same meaning as (P > Q)A(Q = P). According to the previous section, Q > P 
is read as “P if Q,” and P>Q can be read as “P only if Q.” Therefore we 
pronounce P © Q as “P if and only if Q.” For example, given an integer a, 
we have the true statement 


(a is even) } (a is divisible by 2), 


which we can read as “The integer a is even if and only if a is divisible by 2.” 


Richard Hammack Book of Proof 


Biconditional Statements 47 


The truth table for = is shown below. Notice that in the first and last 
rows, both P>Q and Q => P are true (according to the truth table for =>), so 
(P > Q)A(Q =P) is true, and hence P + Q is true. However, in the middle 
two rows one of P > Q or Q = P is false, so (P > Q) A(Q => P) is false, making 
P <Q false. 


PQ 


Tafa] NPN] to 
AR) AR] SO 
Apa] ly|| > 


Compare the statement R : (a is even) = (a is divisible by 2) with this 
truth table. If a is even then the two statements on either side of = are 
true, so according to the table R is true. If a is odd then the two statements 
on either side of = are false, and again according to the table R is true. 
Thus R is true no matter what value a has. In general, P = Q being true 
means P and Q are both true or both false. 

Not surprisingly, there are many ways of saying P = Q in English. The 
following constructions all mean P © Q: 


P if and only if Q. 

P is anecessary and sufficient condition for Q. 

For P it is necessary and sufficient that Q. PeQ 
P is equivalent to Q. 

If P, then Q, and conversely. 


The first three of these just combine constructions from the previous section 

to express that P > Q and Q = P. Inthe last one, the words “...and conversely” 
mean that in addition to “If P, then Q” being true, the converse statement 
“If Q, then P” is also true. 


Exercises for Section 2.4 

Without changing their meanings, convert each of the following sentences into a 
sentence having the form “P if and only if Q.” 

1. For matrix A to be invertible, it is necessary and sufficient that det(A) # 0. 

. Ifa function has a constant derivative then it is linear, and conversely. 

. If xy =0 then x =0 or y=0, and conversely. 

. IfaeQ then 5a€ Q, and if 5a€ Q thenaeQ. 


. For an occurrence to become an adventure, it is necessary and sufficient for one 
to recount it. (Jean-Paul Sartre) 


a kr Ww bo 


Free PDF version C\E 


48 Logic 


2.5 Truth Tables for Statements 


You should now know the truth tables for a, v, ~, > and «. They should 
be internalized as well as memorized. You must understand the symbols 
thoroughly, for we now combine them to form more complex statements. 
For example, suppose we want to convey that one or the other of P and Q 
is true but they are not both true. No single symbol expresses this, but we 
could combine them as 
(PvQ)A~(P AQ), 


which literally means: 
P or Q is true, and it is not the case that both P and Q are true. 


This statement will be true or false depending on the truth values of P and Q. 
In fact we can make a truth table for the entire statement. Begin as usual 
by listing the possible true/false combinations of P and Q on four lines. The 
statement (P vQ)a ~ (P AQ) contains the individual statements (P v Q) and 
(P AQ), so we next tally their truth values in the third and fourth columns. 
The fifth column lists values for ~ (P AQ), and these are just the opposites 
of the corresponding entries in the fourth column. Finally, combining the 
third and fifth columns with a, we get the values for (Pv Q)a ~(P AQ) in 
the sixth column. 


P/Q] @vQ) | (PAQ) | ~(PAQ) | PvQ)ar~(PAQ) 
fs i 7 T F F 
TF ie F T T 
F\T 7 F 7 T 
F\F F F T F 


This truth table tells us that (P vQ)a ~(P AQ) is true precisely when one 
but not both of P and Q are true, so it has the meaning we intended. (Notice 
that the middle three columns of our truth table are just “helper columns” 
and are not necessary parts of the table. In writing truth tables, you may 
choose to omit such columns if you are confident about your work.) 

For another example, consider the following familiar statement about 
real numbers x and y: 


The product xy equals zero if and only if x = 0 or y=0. 


This can be modeled as (xy =0) = (x =0 v y=0). If we introduce letters P,Q 
and R for the statements xy = 0, x = 0 and y = 0, it becomes P = (QVR). Notice 
that the parentheses are necessary here, for without them we wouldn’t know 
whether to read the statement as P © (QV R) or (P©Q)VR. 


Richard Hammack Book of Proof 


Truth Tables for Statements 49 


Making a truth table for P = (Q v R) entails a line for each T/F combina- 
tion for the three statements P, Q and R. The eight possible combinations 
are tallied in the first three columns of the following table. 


q QvR Ps(QvR) 


T T 


Sa Sy SpN PRAT RY] 
RSPR SS 
ARR) SRR] 
TPQ RSYPR PRAT 
me) eee 


We fill in the fourth column using our knowledge of the truth table for v. 
Finally the fifth column is filled in by combining the first and fourth columns 
with our understanding of the truth table for +. The resulting table gives 
the true/false values of P = (Q v R) for all values of P, Q and R. 

Notice that when we plug in various values for x and y, the statements 
P:xy=0,Q:x=0 and R: y=0 have various truth values, but the statement 
Po(QvR) is always true. For example, if x =2 and y= 8, then P, Q and 
R are all false. This scenario is described in the last row of the table, and 
there we see that P = (QV R) is true. Likewise if x = 0 and y =7, then P 
and Q are true and R is false, a scenario described in the second line of 
the table, where again P = (Q v R) is true. There is a simple reason why 
P ©(QvR) is true for any values of x and y: It is that P © (Q VR) represents 
(xy =0)@(x=0 Vv y=0), which is a true mathematical statement. It is 
absolutely impossible for it to be false. 

This may make you wonder about the lines in the table where P = (QvR) 
is false. Why are they there? The reason is that P = (Qv R) can also 
represent a false statement. To see how, imagine that at the end of the 
semester your professor makes the following promise. 


You pass the class if and only if you get an “A” on the final or you get a 
“B” on the final. 


This promise has the form P = (Q v R), so its truth values are tabulated in 
the above table. Imagine it turned out that you got an “A” on the exam but 
failed the course. Then surely your professor lied to you. In fact, P is false, 
Q is true and R is false. This scenario is reflected in the sixth line of the 
table, and indeed P = (Q v R) is false (i.e., it is a lie). 


Free PDF version [S)= 


50 Logic 


The moral of this example is that people can lie, but true mathematical 
statements never lie. 

We close this section with a word about the use of parentheses. The 
symbol ~ is analogous to the minus sign in algebra. It negates the expression 
it precedes. Thus ~ P vQ means (~ P)vQ, not ~ (PvQ). In ~ (Pv Q), the 
value of the entire expression P v Q is negated. 


Exercises for Section 2.5 


Write a truth table for the logical statements in problems 1-9: 


1. PvQ=>R) 4. ~(PvQ)v(~P) 7. (PA~P)>Q 
2. (QVR)SRAQ) 5. (PA~P)vQ 8 Pv(QA~R) 
3. ~(P>Q) 6. (PA~P)AQ 9. ~(~Pv~Q) 


10. Suppose the statement ((P \AQ)v R)>(RVvS) is false. Find the truth values of 
P,Q,R and S. (This can be done without a truth table.) 

11. Suppose P is false and that the statement (R > S) @(P AQ) is true. Find the 
truth values of R and S. (This can be done without a truth table.) 


2.6 Logical Equivalence 


In contemplating the truth table for P © Q, you probably noticed that P = Q 
is true exactly when P and Q are both true or both false. In other words, 
P ©Q is true precisely when at least one of the statements PAQ or~PA~Q 
is true. This may tempt us to say that P © Q@ means the same thing as 
(PAQ)v(~PA~Q). 

To see if this is really so, we can write truth tables for P © Q and 
(PAQ)v(~ PA ~Q). In doing this, it is more efficient to put these two 
statements into the same table, as follows. (This table has helper columns 
for the intermediate expressions ~ P, ~Q, (P AQ) and (~ Pa ~ Q).) 


P|\Q| ~P | ~Q | (PAQ) | (~PA~Q) || PAQ)V(~PA~Q) | PSQ 
T|T | F F T F T T 
T\|F | F T F F F F 
F\T T F F F F F 
F\F T T F T T T 


The table shows that P © Q and (PAQ)v(~ Pa ~ Q) have the same truth 
value, no matter the values P and Q. It is as if P © Q and (PAQ)v(~PA~Q) 
are algebraic expressions that are equal no matter what is “plugged into” 


Richard Hammack Book of Proof 


Logical Equivalence 51 


variables P and Q. We express this state of affairs by writing 
PoQ = (PAQ)v(~PA~Q) 


and saying that P = Q and (PAQ)v(~ Pa ~Q) are logically equivalent. 
In general, two statements are logically equivalent if their truth 
values match up line-for-line in a truth table. 
Logical equivalence is important because it can give us different (and 
potentially useful) ways of looking at the same thing. As an example, the 
following table shows that P > Q is logically equivalent to (~ Q) > (~ P). 


P|\Q | ~P | ~Q || (~Q)>(-P)|P>Q 
T|T | F F T T 
T\|F FF T F F 
F\T T F T T 
F\F T T T T 
The fact P>Q = (~Q)=>(~P) is useful because so many theorems have 


the form P > Q. As we will see in Chapter 5, proving such a theorem may 
be easier if we express it in the logically equivalent form (~ Q) > (~ P). 

Two pairs of logically equivalent statements in particular are significant 
enough to have a special name: DeMorgan’s laws. 


Fact 2.1 (DeMorgan’s Laws) 
1. ~(PAQ) = (~P)v(~Q) 
2. ~(PVQ) = (~P)A(~Q) 


The first of DeMorgan’s laws is verified by the following table. You are 
asked to verify the second in one of the exercises. 


P/Q] ~P | ~Q | PAQ | ~PAQ) | (~P)v(~Q) 
T|T | F F T F F 
T|F | F T F T T 
F\T| T F F T T 
F\F| T T F T T 


DeMorgan’s laws are actually very natural and intuitive. Consider the 
statement ~ (P AQ), which we can interpret as meaning that it is not the 
case that both P and Q are true. If it is not the case that both P and Q are 
true, then at least one of P or @ is false, in which case (~ P) v (~ Q) is true. 
Thus ~(P AQ) means the same thing as (~ P) v(~ Q). 


Free PDF version [S)= 


52 Logic 


DeMorgan’s laws can be very useful. Suppose we happen to know that 
some statement of form ~ (P v Q) is true. The second of DeMorgan’s laws 
tells us that (~ P)A(~ Q) is also true, hence ~ P and ~ Q are both true as 
well. Quickly obtaining this additional information can be extremely useful. 

Here is a summary of some significant logical equivalences. Those that 
are not immediately obvious can be verified with truth tables. 


P>Q = (~Q)>(~P) Contrapositive law (2.1) 

. ; = =a - DeMorgan’s laws (2.2) 

oS _ 3 a Commutative laws (2.3) 

; ‘ iC 7 - ‘ 5 a ae Distributive laws (2.4) 
P : c i = . : 3) Associative laws (2.5) 


Notice how the distributive law P \(Q v R) = (PAQ)v(P AR) has the 
same structure as the distributive law p-(q+r)=p-q+p-r from algebra. 
Concerning the associative laws, the fact that P \(Q AR) =(PAQ)AR means 
that the position of the parentheses is irrelevant, and we can write this as 
PAQAR without ambiguity. Similarly, we may drop the parentheses in an 
expression such as Pv(Q VR). 

But parentheses are essential when there is a mix of A and v, as in 
Pv(QAR). Indeed, Pv(Q AR) and (PVQ)AR are not logically equivalent. 
(See Exercise 13 for Section 2.6, below.) 


Exercises for Section 2.6 


A. Use truth tables to show that the following statements are logically equivalent. 


1. PAQVR)=(PAQ)VPAR) 5. ~(PVQVR) = (~P)A(~Q)A(~R) 
2. PV(QAR)=(PVQ)AP VR) 6. ~(PAQAR) = (~P)v(~Q)v(~R) 
38. P>Q=(~P)vQ 7. P>Q = (PA~Q>(QA~Q) 
4. ~(PVQ) = (~P)A(~Q) 8. ~PeQ=(P>~Q)A\(~Q=>P) 


B. Decide whether or not the following pairs of statements are logically equivalent. 
9. PaQ and ~(~Pv~Q) 12. ~(P>Q) and PA~Q 
10. (P>Q)vVR and ~((PA~Q)A~R) 18. PV(QAR) and (PvVQ)AR 
11. (~P)A(P=>Q) and ~(Q=>P) 14. PaA(Qv ~Q) and (~ P)> (Qn ~Q) 


Richard Hammack Book of Proof 


Quantifiers 53 


2.7 Quantifiers 


Using symbols a, v, ~, > and ©, we can deconstruct many English sentences 
into a symbolic form. As we have seen, this symbolic form can help us 
understand the logical structure of sentences and how different sentences 
may actually have the same meaning (as in logical equivalence). 

But these symbols alone are not powerful enough to capture the full 
meaning of every statement. To see why, imagine that we are dealing with 
an infinite set X = {x1,x2,x3,...} of integers. Suppose we want to express 
the statement “Every element of X is odd.” We would have to write 


P(x1) A P(x2) A P(x3)AP(xa)A-::, 


where P(x) is the open sentence “x is odd.” And if we wanted to express 
“There is at least one element of X that is odd,” we'd have to write 


P(x1) V P(x2) v P(x3) V P(x4)V °°. 


The problem is that these expressions trail on forever. 

To overcome this defect, we will introduce two new symbols V and 3. The 
symbol V stands for the phrase “for all” and 4 stands for “there exists.” Thus 
the statement “Every element of X is odd.” is written symbolically as 


VYxe X, P(x), 
and “There is at least one element of X that is odd,” is written succinctly as 
Axe X, P(x), 


These new symbols are called quantifiers. 


Definition 2.1 The symbols V and J are called quantifiers. 
VY stands for the phrase “For all” or “For every,” or “For each,” 


4 stands for the phrase “There exists a” or “There is a.” 


Thus the statement 
For every ne Z, 2n is even, 
can be expressed in either of the following ways: 


Vne Z, 2n is even, 


VYneZ, E(2n) (where E(x) means x is even). 


Free PDF version [S)= 


54 Logic 


Likewise, a statement such as 
There exists a subset X of N for which |X| =5. 
can be translated as 
AX, (X SCN)A(X|=5) or AXCN,|X|/=5 or AXE A(N),|X|=5. 


The symbols V and J are called quantifiers because they refer in some 
sense to the quantity (i.e., all or some) of the variable that follows them. The 
symbol V is called the universal quantifier and J is called the existential 
quantifier. Statements containing them are called quantified statements. 
A statement beginning with V is called a universally quantified state- 
ment, and one beginning with i is called an existentially quantified 
statement. 


Example 2.5 The following English statements are paired with their 
translations into symbolic form. 


Every integer that is not odd is even. 
VYneZ,~(nisodd)>(niseven), or VWneZ,~O(n)>E(n). 


There is an integer that is not even. 
Ane Z,~ E(n). 


For every real number x, there is a real number y for which y? = x. 
VxeER AyER, y2 =x. 


Given any two rational numbers a and b, the product ab is rational. 
Va,beQ,abeQ. 


Given a set X (such as, but not limited to, N, Z, Q, etc.), a quantified 
statement of form V x €« X,P(x) is understood to be true if P(x) is true for every 
xeX. If there is at least one x ¢ X for which P(x) is false, then Vx € X, P(x) 
is a false statement. Similarly, 4x ¢ X, P(x) is true provided that P(x) is true 
for at least one element x € X; otherwise it is false. Thus each statement in 
Example 2.5 is true. Here are some examples of quantified statements that 
are false: 


Example 2.6 The following false quantified statements are paired with 
their translations. 


Every integer is even. 
YneZ, E(n). 


Richard Hammack Book of Proof 


Quantifiers 55 


There is an integer n for which n? = 2. 
Ane Z,n?=2. 


For every real number x, there is a real number y for which y? = x. 
VxeR, JyeR y2 =x. 


Given any two rational numbers a and b, the number vad is rational. 


Va,beEQ, VabeQ. 


Example 2.7 When a statement contains two quantifiers you must be 
very alert to their order, for reversing the order can change the meaning. 
Consider the following statement from Example 2.5. 


VxERJAyvyeR y =x. 


This statement is true, for no matter what number x is there exists a number 
y = °/x for which y® = x. Now reverse the order of the quantifiers to get the 
new statement 


AvyeR VxeR yy =x. 


This new statement says that there exists a particular number y with the 
property that y* = x for every real number x. Since no number y can have 
this property, the statement is false. The two statements above have entirely 
different meanings. 


Quantified statements are often misused in casual conversation. Maybe 
you've heard someone say “All students do not pay full tuition.” when they 
mean “Not all students pay full tuition.” While the mistake is perhaps 
marginally forgivable in casual conversation, it must never be made ina 
mathematical context. Do not say “All integers are not even.” because that 
means there are no even integers. Instead, say “Not all integers are even.” 


Exercises for Section 2.7 


Write the following as English sentences. Say whether they are true or false. 


1. VxeER,x2>0 6. AneN, VX € AN), |Xl<n 
2. VxeER,AneN,x” =0 7 VX CN,AneE Z,|X|=n 

3. JaeER, VxeRax=x 8. VneZ, AX CN,|X|=n 
4, VXeE PA(N),XCR 9. VneZ,AiméeZ,m=nt+5 
5. VneN, dX € AN), |X| <n 10. SmeZVneZ,m=nt+8 


Free PDF version [S)= 


56 Logic 


2.8 More on Conditional Statements 


It is time to address a very important point about conditional statements 
that contain variables. To motivate this, let’s return to the following example 
concerning integers x: 


(x is a multiple of 6) => (x is even). 


As noted earlier, since every multiple of 6 is even, this is a true statement 
no matter what integer x is. We could even underscore this fact by writing 
this true statement as 


Vxe Z,(x is a multiple of 6) > (x is even). 
But now switch things around to get the different statement 
(x is even) > (x is a multiple of 6). 


This is true for some values of x such as —6, 12, 18, etc., but false for others 
(such as 2, 4, etc.). Thus we do not have a statement, but rather an open 
sentence. (Recall from Section 2.1 that an open sentence is a sentence 
whose truth value depends on the value of a certain variable or variables.) 
However, by putting a universal quantifier in front we get 


Vxe Z, (x is even) > (x is a multiple of 6), 


which is definitely false, so this new expression is a statement, not an open 
sentence. In general, given any two open sentences P(x) and Q(x) about 
integers x, the expression V x € Z, P(x) > Q(x) is either true or false, so it is 
a statement, not an open sentence. 

Now we come to the very important point. In mathematics, whenever 
P(x) and Q(x) are open sentences concerning elements x in some set X 
(depending on context), an expression of form P(x) > Q(x) is understood to be 
the statement V xe X, P(x) > Q(x). In other words, if a conditional statement 
is not explicitly quantified then there is an implied universal quantifier in 
front of it. This is done because statements of the form Vx « X, P(x) > Q(x) 
are so common in mathematics that we would get tired of putting the Vxe X 
in front of them. 


Thus the following sentence is a true statement (as it is true for all x). 


If x is a multiple of 6, then x is even. 


Richard Hammack Book of Proof 


Translating English to Symbolic Logic 57 


Likewise, the next sentence is a false statement (as it is not true for all x). 
If x is even, then x is a multiple of 6. 


This leads to the following significant interpretation of a conditional 
statement, which is more general than (but consistent with) its definition 
in Section 2.3. 


Definition 2.2 If P and Q are statements or open sentences, then 
“If P, then Q,” 


is a statement. This statement is true if it’s impossible for P to be true 
while Q is false. It is false if there is at least one instance in which P is 
true but Q is false. 


Thus the following are true statements: 


If «ER, then x7+1>0. 


If a function f is differentiable on R, then f is continuous on R. 
Likewise, the following are false statements: 


If p is a prime number, then p is odd. (2 is prime.) 


2 


If f is a rational function, then f has an asymptote.  (x* is rational.) 


2.9 Translating English to Symbolic Logic 


In writing (and reading) proofs of theorems, we must always be alert to the 
logical structure and meanings of the sentences. Sometimes it is necessary 
or helpful to parse them into expressions involving logic symbols. This may 
be done mentally or on scratch paper, or occasionally even explicitly within 
the body of a proof. The purpose of this section is to give you sufficient 
practice in translating English sentences into symbolic form so that you 
can better understand their logical structure. Here are some examples: 


Example 2.8 Consider the Mean Value Theorem from Calculus: 


If f is continuous on the interval [a,b] and differentiable on (a,b), then 


there is a number c € (a,b) for which f'(c) = _ 


Here is a translation to symbolic form: 


((F cont. on [a,b]) A (f is diff. on (a,6))) > (Ac € (a,b), fe) = FO), 


Free PDF version C\E 


58 Logic 


Example 2.9 Consider Goldbach’s conjecture, from Section 2.1: 
Every even integer greater than 2 is the sum of two primes. 


This can be translated in the following ways, where P is the set of prime 
numbers and X = {4,6,8,10,...} is the set of even integers greater than 2. 


(ne X)=>(dp,qeP,n=p+q) 
VnexX,4jp,qeP,n=pt+q 


These translations of Goldbach’s conjecture illustrate an important point. 
The first has the basic structure (n € X) > Q(n) and the second has structure 
YneX,Q(n), yet they have exactly the same meaning. This is significant. 
Every universally quantified statement can be expressed as a conditional 
statement. 


Fact 2.2 Suppose X is a set and Q(x) is a statement about x for each 
xé€X. The following statements mean the same thing: 

VxeX, Q(x) 

(x€ X)> Q(x). 


This fact is significant because so many theorems have the form of 
a conditional statement. (The Mean Value Theorem is an example.) In 
proving a theorem we have to think carefully about what it says. Sometimes 
a theorem will be expressed as a universally quantified statement, but it will 
be more convenient to think of it as a conditional statement. Understanding 
the above fact allows us to switch between the two forms. 

The section closes with some final points. In translating a statement, 
be attentive to its intended meaning. Don’t jump into, for example, auto- 
matically replacing every “and” with \ and “or” with v. An example: 


At least one of the integers x and y is even. 


Don’t be led astray by the presence of the word “and.” The meaning of 
the statement is that one or both of the numbers is even, so it should be 
translated with “or,” not “and”: 


(x is even) v (y is even). 


Finally, the logical meaning of “but” can be captured by “and.” The 
sentence “The integer x is even, but the integer y is odd,” is translated as 


(x is even) A (y is odd). 


Richard Hammack Book of Proof 


Negating Statements 59 


Exercises for Section 2.9 


Translate each of the following sentences into symbolic logic. 


1. 
. The number «x is positive but the number y is not positive. 


a kr &W Wb 


12. 


13. 


If f is a polynomial and its degree is greater than 2, then f’ is not constant. 


. Ifx is prime, then \x is not a rational number. 
. For every prime number p there is another prime number gq with q > p. 


. For every positive number ec, there is a positive number 6 for which |x—a| <6 


implies |f(«)- f(a)| <e. 


. For every positive number « there is a positive number M for which |f(x)- b| <«, 


whenever x>M. 


. There exists a real number a for which a+ «x =x for every real number x. 

. I don’t eat anything that has a face. 

. Ifx is a rational number and x 40, then tan(x) is not a rational number. 

. If sin(x) <0, then it is not the case that O<x<1z. 

. There is a Providence that protects idiots, drunkards, children and the United 


States of America. (Otto von Bismarck) 


You can fool some of the people all of the time, and you can fool all of the people 
some of the time, but you can’t fool all of the people all of the time. (Abraham 
Lincoln) 


Everything is funny as long as it is happening to somebody else. (Will Rogers) 


2.10 Negating Statements 


Given a statement R, the statement ~ R is called the negation of R. If R is 
a complex statement, then it is often the case that its negation ~ R can be 
written in a simpler or more useful form. The process of finding this form 
is called negating R. In proving theorems it is often necessary to negate 
certain statements. We now investigate how to do this. 


We have already examined part of this topic. DeMorgan’s laws 


~(PAQ) = (~P)v(~Q) (2.6) 
~(PvQ@) = (~P)A(~Q) (2.7) 


(from Section 2.6) can be viewed as rules that tell us how to negate the 
statements PAQ@ and PvQ. Here are some examples that illustrate how 
DeMorgan’s laws are used to negate statements involving “and” or “or.” 


Free PDF version [S)= 


60 Logic 


Example 2.10 Consider negating the following statement. 
R: You can solve it by factoring or with the quadratic formula. 


Now, R means (You can solve it by factoring) v (You can solve it with Q.F.), 
which we will denote as Pv Q. The negation of this is 

~(PVQ) = (~P)A(~Q). 
Therefore, in words, the negation of R is 


~R: You can’t solve it by factoring and you can’t solve it with 
the quadratic formula. 


Maybe you can find ~ R without invoking DeMorgan’s laws. That is good; 
you have internalized DeMorgan’s laws and are using them unconsciously. 


Example 2.11 We will negate the following sentence. 
R: The numbers x and y are both odd. 
This statement means (x is odd) A (y is odd), so its negation is 


~ ((x is odd) A (y is odd)) ~ (x is odd) v ~(y is odd) 


(x is even) v (y is even). 


Therefore the negation of R can be expressed in the following ways: 


~R: The number x is even or the number y is even. 
~R: At least one of x and y is even. 


Now let’s move on to a slightly different kind of problem. It’s often 
necessary to find the negations of quantified statements. For example, 
consider ~ (Vx € N, P(x)). Reading this in words, we have the following: 


It is not the case that P(x) is true for all natural numbers x. 


This means P(x) is false for at least one x. In symbols, this is dx €N, ~ P(x). 
Thus ~ (Vx € N, P(x)) = dx EN, ~ P(x). Similarly, you can reason out that 
~ (axe N, P(x)) =VxeEN, ~ P(x). In general: 


~(VxeEX,P(x)) = Axe X,~ P(x), (2.8) 
~ (axe X,P(x)) = WxeX,~ P(x). (2.9) 
Be sure that you understand these two logical equivalences. They conform 


to our everyday use of language, but they pin down the meaning in a 
mathematically precise way. 


Richard Hammack Book of Proof 


Negating Statements 61 


Example 2.12 Consider negating the following statement. 
R: The square of every real number is non-negative. 


Symbolically, R can be expressed as Vx € R, x? > 0, and thus its negation is 
~(VxER, x? = 0) = 3x ER, ~ (x? =0) = 3x ER, x? <0. In words, this is 


~R: There exists a real number whose square is negative. 


Observe that R is true and ~ R is false. Maybe you can get ~ R immediately, 
without using Equation (2.8) as we did above. If so, that is good; if not, you 
should be there soon. 


Example 2.13 Ifastatement has multiple quantifiers, negating it involves 
several iterations of Equations (2.8) and (2.9). Consider the following: 


S : For every real number «x there is a real number y for which y? = x. 


This statement asserts any real number x has a cube root y, so it’s true. 
Symbolically S can be expressed as 


VxeRAveER, y?=x. 


Let’s work out the negation of this statement. 


~(VxERAyER, y=x) = AxeR,~(GyeR,y?=x) 


= 4xeR,VyER, ~(y? =x) 
= J4xeERVyER,y? Fx. 


Thus the negation is a (false) statement that can be written in either of the 
following ways. 


~ §: There is a real number x such that for all real numbers y, y? 4 x. 
~ §: There is a real number x for which y? x for all real numbers y. 


In writing proofs you will occasionally have to negate a conditional 
statement P > Q. The remainder of this section describes how to do this. 
To begin, look at the expression ~ (P > Q), which literally says “P > Q is 
false.” You know from the truth table for > that the only way that P>Q 
can be false is if P is true and Q is false. Therefore 


~(P>Q) = PaA~Q. (2.10) 


(In fact, in Exercise 12 of Section 2.6, you used a truth table to verify that 
these two statements are indeed logically equivalent.) 


Free PDF version [S)= 


62 Logic 
Example 2.14 Negate the following statement about a particular (i.e., 
constant) number a. 

R: If a is odd then a? is odd. 
Using Equation (2.10), we get the following negation. 

~R:a is odd and a’ is not odd. 


Example 2.15 This example is like the previous one, but the constant a 
is replaced by a variable x. We will negate the following statement. 


R: If x is odd then x? is odd. 
As in Section 2.8, we interpret this as the universally quantified statement 

R: Vx eZ, (x odd) = (x? odd). 
By Equations (2.8) and (2.10), we get the following negation for R. 

~ (Vx € Z, (x odd) > (x? odd)) = Axe Z,~ ((x odd) = (x? odd)) 
= JxeZ,(x odd) ~ (x? odd). 

Translating back into words, we have 

~R: There is an odd integer x whose square is not odd. 


Notice that R is true and ~ R is false. 

The above Example 2.15 showed how to negate a conditional statement 
P(x) > Q(x). This type of problem can sometimes be embedded in more 
complex negation. See Exercise 5 below (and its solution). 


Exercises for Section 2.10 

Negate the following sentences. 

. The number x is positive, but the number y is not positive. 
. Ifx is prime, then \/x is not a rational number. 


. For every prime number p, there is another prime number q with q > p. 


rR 0 DN = 


. For every positive number ¢, there is a positive number 6 such that |x-a| <6 
implies |f(x)- f(a)| <e. 


5. For every positive number ¢, there is a positive number M for which |f(x)-b| <e 
whenever x > M. 


6. There exists a real number a for which a+ x =x for every real number x. 


7. I don’t eat anything that has a face. 


Richard Hammack Book of Proof 


Logical Inference 63 


8. If x is a rational number and x 40, then tan(x) is not a rational number. 

9. If sin(x) < 0, then it is not the case that O<x<n. 
10. If f is a polynomial and its degree is greater than 2, then f’ is not constant. 
11. You can fool all of the people all of the time. 


12. Whenever I have to choose between two evils, I choose the one I haven’t tried 
yet. (Mae West) 


2.11 Logical Inference 


Suppose we know that a conditional statement P > Q is true. This tells us 
that whenever P is true, Q will also be true. By itself, P > Q being true does 
not tell us that either P or Q is true (they could both be false, or P could 
be false and Q true). But if in addition we happen to know that P is true, 
then Q must be true. This is called a logical inference: From two true 
statements we infer that a third statement is true. In essence, statements 
P=Q and P are “added together” to get Q. We can indicate this by stacking 
P=Q and P one atop the other with a line separating them from Q. The 
intended meaning is that P > Q combined with P produces Q. 


P>Q 
P 


Q 
This is a very frequently-used pattern of thought. (In fact, it is exactly the 
pattern we used in the example on page 34.) This rule even has a name. It 
is called the modus ponens rule. 
Two other logical inferences, called modus tollens and elimination 
are listed below. In each case you should convince yourself (based on your 


knowledge of the relevant truth tables) that the truth of the statements 
above the line forces the statement below the line to be true. 


Mopus Ponens Mopus To.uuens ELIMINATION 
P>Q P>Q PvQ 
P ~Q ~P 
Q ~P Q 


It is important to internalize these rules. (You already use at least modus 
ponens and elimination in daily life anyway.) But you need not remember 
their names; few mathematicians can recall the names, though they use 
the rules constantly. The names are not important, but the rules are. 


Free PDF version [S)= 


64 Logic 


Three additional logical inferences are listed below. The first states the 
obvious fact that if P and Q are both true, then so is the statement PAQ. 
On the other hand, P AQ being true forces P (also Q) to be true. Finally, if 
P is true, then P vQ must be true, no matter what statement Q is. 


P 
Q PaQ P 
PaQ P Pv@Q 


These inferences are so intuitively obvious that they scarcely need to be 
mentioned. However, they represent certain patterns of reasoning that we 
will frequently apply to sentences in proofs, so we should be cognizant of 
the fact that we are using them. 


2.12 An Important Note 


It is important to be aware of the reasons that we study logic. There are 
three very significant reasons. First, the truth tables we studied tell us the 
exact meanings of the words such as “and,” “or,” “not” and so on. For instance, 
whenever we use or read the “If..., then” construction in a mathematical 
context, logic tells us exactly what is meant. Second, the rules of inference 
provide a system in which we can produce new information (statements) 
from known information. Finally, logical rules such as DeMorgan’s laws 
help us correctly change certain statements into (potentially more useful) 
statements with the same meaning. Thus, logic helps us understand the 
meanings of statements, and it also produces new meaningful statements. 

Logic is the glue that holds strings of statements together and pins down 
the exact meaning of certain key phrases such as the “/f..., then” or “For 
all” constructions. Logic is the common language that all mathematicians 
use, so we must have a firm grip on it in order to write and understand 
mathematics. 

But despite its fundamental role, logic’s place is in the background of 
what we do, not the forefront. From here on, the beautiful symbols a, v, 
>,<¢,~, V and dare rarely written. But we are aware of their meanings 
constantly. When reading or writing a sentence involving mathematics we 
parse it with these symbols, either mentally or on scratch paper, so as to 
understand the true and unambiguous meaning. 


Richard Hammack Book of Proof 


CHAPTER 3 


Counting 


t may seem peculiar that a college-level text has a chapter on counting. 
At its most basic level, counting is a process of pointing to each object 
in a collection and calling off “one, two, three,...” until the quantity of 
objects is determined. How complex could that be? Actually, counting 
can become quite subtle, and in this chapter we explore some of its more 
sophisticated aspects. Our goal is still to answer the question “How many?” 
but we introduce mathematical techniques that bypass the actual process of 
counting individual objects. Sets play a big role in our discussions because 
the things we need to count are often naturally grouped together into a set. 
The concept of a list is also extremely useful. 


3.1 Lists 


A list is an ordered sequence of objects. A list is denoted by an opening 
parenthesis, followed by the objects, separated by commas, followed by a 
closing parenthesis. For example (a,b,c,d,e) is a list consisting of the first 
five letters of the English alphabet, in order. The objects a,b,c,d,e are called 
the entries of the list; the first entry is a, the second is b, and so on. If the 
entries are rearranged we get a different list, so, for instance, 


(a,b,c,d,e) #(b,a,c,d,e). 


A list is somewhat like a set, but instead of being a mere collection of objects, 
the entries of a list have a definite order. For sets we have 


{a,b,c,d,e} = {b,a,c,d,e}, 


but—as noted above—the analogous equality for lists does not hold. 
Unlike sets, lists can have repeated entries. Thus (5,3,5,4,3,3) is a 

perfectly acceptable list, as is (S,O,S). The length of a list is its number of 

entries. So (5,3,5,4,3,3) has length six, and (S,O,S) has length three. 


66 Counting 


For more examples, (a, 15) is a list of length two. And (0,(0,1,1)) is a list 
of length two whose second entry is a list of length three. Two lists are 
equal if they have exactly the same entries in exactly the same positions. 
Thus equal lists have the same number of entries. If two lists have different 
lengths, then they can not be equal. Thus (0,0,0,0,0,0) 4 (0,0,0,0,0). Also 


bread 
(g,r,0,C,e,r,y, L,i,s,t) # ( sadts ) 


because the list on the left has length eleven but the list on the right has 
just one entry (a piece of paper with some words on it). 

There is one very special list which has no entries at all. It is called the 
empty list and is denoted (). It is the only list whose length is zero. 

For brevity we often write lists without parentheses, or even commas. 
For instance, we may write (S,O,S) as SOS if there is no risk of confusion. 
But be alert that doing this can lead to ambiguity: writing (9,10,11) as 
91011 may cause us to confuse it with (9,1,0,1,1). Here it’s best to retain 
the parenthesis/comma notation or at least write the list as 9,10,11. A list 
of symbols written without parentheses and commas is called a string. 

The process of tossing a coin ten times may be described by a string such 
as HHTHTTTHHT. Tossing it twice could lead to any of the outcomes HH, 
HT, TH or TT. Tossing it zero times is described by the empty list (). 

Imagine rolling a dice five times and recording the outcomes. This might 
be described by the list (©),¢),65,0), £3), meaning that you rolled () first, then 
(4), then &), etc. We might abbreviate this list as MHL, or 3,5,3,1,6. 

Now imagine rolling a pair of dice, one white and one black. A typical 
outcome might be modeled as a set like {C3}. Rolling the pair six times 
might be described with a list of six such outcomes: 


({C3), @@}, {07, BB}, {63, B}, {OC @}, {0 eB}, {7 e}). 


We might abbreviate this list as 38,83, 63, O08, 8, 8. 

We study lists because many real-world phenomena can be described 
and understood in terms of them. Your phone number can be identified 
as a list of ten digits. (Order is essential, for rearranging the digits can 
produce a different phone number.) A byte is another important example 
of a list. A byte is simply a length-eight list of 0’s and 1’s. The world of 
information technology revolves around bytes. And the examples above 
show that multi-step processes (such as rolling a pair of dice six times) can 
be modeled as lists. 

We now explore methods of counting or enumerating lists and processes. 


Richard Hammack Book of Proof 


The Multiplication Principle 67 


3.2 The Multiplication Principle 


Many practical problems involve counting the number of possible lists that 
satisfy some condition or property. 

For example, suppose we make a list of length three having the property 
that the first entry must be an element of the set {a,b,c}, the second entry 
must be in {5,7} and the third entry must be in {a,x}. Thus (a,5,a) and 
(b,5,a) are two such lists. How many such lists are there all together? To 
answer this question, imagine making the list by selecting the first entry, 
then the second and finally the third. This is described in Figure 3.1. The 
choices for the first list entry are a,b or c, and the left of the diagram 
branches out in three directions, one for each choice. Once this choice is 
made there are two choices (5 or 7) for the second entry, and this is described 
graphically by two branches from each of the three choices for the first entry. 
This pattern continues for the choice for the third entry, which is either 
a or x. Thus, in the diagram there are 3-2-2 = 12 paths from left to right, 
each corresponding to a particular choice for each entry in the list. The 
corresponding lists are tallied at the far-right end of each path. So, to 
answer our original question, there are 12 possible lists with the stated 
properties. 


Resulting list 
first choice second choice _ third choice / 
tL 


(a,5,a) 
(a, 5,x) 
(a, 7,a) 
(a, 7,x) 
(b,5,a) 
(6,5,x) 
(b,7,a) 
(b,7,x) 
(c,5,a) 
(c,5,x) 
(c,7,a) 
(c,7,x) 


Figure 3.1. Constructing lists of length 3 


In the above example there are 3 choices for the first entry, 2 choices for 
the second entry, and 2 for the third, and the total number of possible lists 
is the product of choices 3-2-2 = 12. This kind of reasoning is an instance of 
what we will call the multiplication principle. We will do one more example 
before stating this important idea. 


Free PDF version [S)= 


68 Counting 


Consider making a list of length 4 from the four letters {a,b,c,d}, where 
the list is not allowed to have a repeated letter. For example, abcd and cadb 
are allowed, but aabc and cacb are not allowed. How many such lists are 
there? 

Let’s analyze this question with a tree representing the choices we have 
for each list entry. In making such a list we could start with the first entry: 
we have 4 choices for it, namely a,b,c or d, and the left side of the tree 
branches out to each of these choices. But once we’ve chosen a letter for 
the first entry, we can’t use that letter in the list again, so there are only 3 
choices for the second entry. And once we’ve chosen letters for the first and 
second entries we can’t use these letters in the third entry, so there are just 
2 choices for it. By the time we get to the fourth entry we are forced to use 
whatever letter we have left; there is only 1 choice. 

The situation is described fully in the below tree showing how to make 
all allowable lists by choosing 4 letters for the first entry, 3 for the second 
entry, 2 for the third entry and 1 for the fourth entry. We see that the total 
number of lists is the product 4-3-2-1= 24. 


1st choice 3rd choice Resulting list 


Ath choice - 


abcd 
abdc 
acbd 
acdb 
adbc 
adcb 
bacd 
badc 
bcad 
bcda 
bdac 
bdca 
cabd 
cadb 
cbad 
cbda 
cdab 
cdba 
dabc 
dacb 
dbac 
dbca 
dcab 
dcba 


2nd choice 


| 


———s 
ee 


CQQOQOOO@OLQOLEOO@OLOSLOOOSLOSLOE 
QOMCOMOOMOOLOLOOOSLOLOOOSLO®) 


Figure 3.2. Constructing lists from letters in {a,b,c,d}, without repetition. 


Richard Hammack Book of Proof 


The Multiplication Principle 69 


These trees show that the number of lists constructible by some specified 
process equals the product of the numbers of choices for each list entry. We 
summarize this kind of reasoning as an important fact. 


Fact 3.1 (Multiplication Principle) Suppose in making a list of 
length n there are a, possible choices for the first entry, ag possible 
choices for the second entry, a3 possible choices for the third entry, and 
so on. Then the total number of different lists that can be made this way 
is the product a1-a9-a3----dn. 


In using the multiplication principle you do not need to draw a tree 
with aj-a9q::::: ay branches. Just multiply the numbers! 


Example 3.1 A standard license plate consists of three letters followed 
by four digits. For example, JRB-4412 and MMX-8901 are two standard 
license plates. How many different standard license plates are possible? 


Solution: A license plate such as JRB-4412 corresponds to a length-7 list 
(J,R,B,4,4,1,2), so we just need to count how many such lists are possible. 
We use the multiplication principle. There are a, = 26 possibilities (one for 
each letter of the alphabet) for the first entry of the list. Similarly, there are 
ag = 26 possibilities for the second entry and a3 = 26 possibilities for the third. 
There are a4 = 10 possibilities for the fourth entry. Likewise as = ag =a7= 10. 
So there is a total of a1-a9-a3-a4:a5-ag-a7 = 26-26-26-10-10-10-10 = 
175,760,000 possible standard license plates. 


Example 3.2 In ordering a café latte, you have a choice of whole, skim or 
soy milk; small, medium or large; and either one or two shots of espresso. 
How many choices do you have in ordering one drink? 


Solution: Your choice is modeled by a list of form (milk, size, shots). There 
are 3 choices for the first entry, 3 for the second and 2 for the third. By the 
multiplication principle, the number of choices is 3-3-2 = 18. 


There are two types of list-counting problems. On one hand, there are 
situations in which list entries can be repeated, as in license plates or 
telephone numbers. The sequence CCX-4144 is a perfectly valid license 
plate in which the symbols C and 4 appear more than once. On the other 
hand, for some lists repeated symbols do not make sense or are not allowed, 
as in the (milk, size, shots) list from Example 3.2. We say repetition is 
allowed in the first type of list and repetition is not allowed in the second 
kind of list. (We will call a list in which repetition is not allowed a non- 
repetitive list.) The next example illustrates the difference. 


Free PDF version [S)= 


70 Counting 


Example 3.3 Consider lists of length 4 made with symbols A,B,C,D,E,F,G. 
(a) How many such lists are possible if repetition is allowed? 
(b) How many such lists are possible if repetition is not allowed? 
(c) How many are there if repetition is not allowed and the list has an E? 
(d) How many are there if repetition is allowed and the list has an E? 


Solutions: 


(a) Imagine the list as containing four boxes that we fill with selections 
from the letters A, B, C, D, E, F and G, as illustrated below. 


7 eT 
7 ey 
7 choices 


7 choices 


We have 7 choices in filling each box. The multiplication principle says 
the total number of lists that can be made this way is 7-7-7-7= 2401. 


(b) This problem is the same as the previous one except that repetition 
is not allowed. We have seven choices for the first box, but once it is 
filled we can no longer use the symbol that was placed in it. Hence 
there are only six possibilities for the second box. Once the second 
box has been filled we have used up two of our letters, and there are 
only five left to choose from in filling the third box. Finally, when the 
third box is filled we have only four possible letters for the last box. 


7 ir 
6 choices 
5 choices 


4 choices 


Thus there are 7-6-5-4 = 840 lists in which repetition does not occur. 


(c) We are asked to count the length-4 lists in which repetition is not 
allowed and the symbol FE must appear somewhere in the list. Thus E 
occurs once and only once in each list. Let us divide these lists into 
four categories depending on whether the E occurs as the first, second, 
third or fourth entry. These four types of lists are illustrated below. 


Type 1 Type 2 Type 3 Type 4 
6 ae 6 ef 6 eT 6 LT 
5 ia i. 5 a 5 ‘a 
4 choices 4 choices 4 choices 4 choices 


Richard Hammack Book of Proof 


The Multiplication Principle 71 


Consider lists of the first type, in which the E appears in the first 
entry. We have six remaining choices (A, B, C, D, F or G) for the second 
entry, five choices for the third entry and four choices for the fourth 
entry. Hence there are 6-5-4 = 120 lists having an E in the first entry. 
As shown above, there are also 6-5-4 = 120 lists having an E in the 
second, third or fourth entry. So there are 120+ 120+ 120+ 120 = 480 
lists with exactly one E. 


(d) Now we seek the number of length-4 lists where repetition is allowed 
and the list must contain an E. Here is our strategy: By Part (a) of 
this exercise there are 7-7-7-7 = 74 = 2401 lists with repetition allowed. 
Obviously this is not the answer to our current question, for many of 
these lists contain no E. We will subtract from 2401 the number of 
lists that do not contain an E. In making a list that does not contain 
an E, we have six choices for each list entry (because we can choose 
any one of the six letters A, B, C, D, F or G). Thus there are 6-6-6-6= 
64 = 1296 lists without an E. So the answer to our question is that 
there are 2401 — 1296 = 1105 lists with repetition allowed that contain 
at least one E. 


Before moving on from Example 3.3, let’s address an important point. 
Perhaps you wondered if Part (d) could be solved in the same way as Part (c). 
Let’s try doing it that way. We want to count the length-4 lists (repetition 
allowed) that contain at least one E. The following diagram is adapted from 
Part (c). The only difference is that there are now seven choices in each slot 
because we are allowed to repeat any of the seven letters. 


Type 1 Type 2 Type 3 Type 4 
7 SEAT 7 LT 7 La 7 EAT 
7 a 7 L 7 — 7 i 
7 choices 7 choices 7 choices 7 choices 


We get a total of 77+ 7° + 7° + 73 = 1372 lists, an answer that is larger 
than the (correct) value of 1105 from our solution to Part (d) above. It is 
easy to see what went wrong. The list (E,Z,A,B) is of type 1 and type 2, 
so it got counted twice. Similarly (E,E,C,E) is of type 1, 2 and 4, so it got 
counted three times. In fact, you can find many similar lists that were 
counted multiple times. In solving counting problems, we must always be 
careful to avoid this kind of double-counting or triple-counting, or worse. 


Free PDF version [S)= 


72 Counting 


The next section presents two new counting principles that codify the 
kind of thinking we used in parts (c) and (d) above. Combined with the 
multiplication principle, they solve complex counting problems in ways that 
avoid the pitfalls of double counting. But first, one more example of the 
multiplication principle highlights another pitfall to be alert to. 


Example 3.4 A non-repetitive list of length 5 is to be made from the 
symbols A, B, C, D, E, F, G. The first entry must be either a B, C or D, and 
the last entry must be a vowel. How many such lists are possible? 


Solution: Start by making a list of five boxes. The first box must contain 
either B, C or D, so there are three choices for it. 


(0000) 


3 choices 


Now there are 6 letters left for the remaining 4 boxes. The knee-jerk action 
is to fill them in, one at a time, using up an additional letter each time. 


et LT 


6 choices 
5 choice 
4 choices 


But when we get to the last box, there is a problem. It is supposed to 
contain a vowel, but for all we know we have already used up one or both 
vowels in the previous boxes. The multiplication principle breaks down 
because there is no way to tell how many choices there are for the last box. 

The correct way to solve this problem is to fill in the first and last boxes 
(the ones that have restrictions) first. 


(C1,0,0,0, G1) 


Then fill the remaining middle boxes with the 5 remaining letters. 


3 ay ei i 
5 eh 
4 choices 
3 choices 


By the multiplication principle, there are 3-5-4-3-2= 360 lists. 


The new principles to be introduced in the next section are usually used 
in conjunction with the multiplication principle. So work a few exercises 
now to test your understanding of it. 


Richard Hammack Book of Proof 


The Multiplication Principle 73 


Exercises for Section 3.2 


1. 


10. 


Consider lists made from the letters T, H, E, O, R, Y, with repetition allowed. 


(a) How many length-4 lists are there? 
(b) How many length-4 lists are there that begin with T? 
(c) How many length-4 lists are there that do not begin with T? 


. Airports are identified with 3-letter codes. For example, Richmond, Virginia has 


the code RIC, and Memphis, Tennessee has MEM. How many different 3-letter 
codes are possible? 


. How many lists of length 3 can be made from the symbols A, B, C, D, E, F if... 


(a) ... repetition is allowed. 

(b) ... repetition is not allowed. 

(ce) ... repetition is not allowed and the list must contain the letter A. 
(d) ... repetition is allowed and the list must contain the letter A. 


. In ordering coffee you have a choice of regular or decaf; small, medium or large; 


here or to go. How many different ways are there to order a coffee? 


. This problem involves 8-digit binary strings such as 10011011 or 00001010 (i.e., 


8-digit numbers composed of 0’s and 1’s). 
(a) How many such strings are there? 
(b) How many such strings end in 0? 
(c) How many such strings have 1’s for their second and fourth digits? 
(d) How many such strings have 1’s for their second or fourth digits? 


. You toss a coin, then roll a dice, and then draw a card from a 52-card deck. How 


many different outcomes are there? How many outcomes are there in which the 
dice lands on ©)? How many outcomes are there in which the dice lands on an 
odd number? How many outcomes are there in which the dice lands on an odd 
number and the card is a King? 


. This problem concerns 4-letter codes made from the letters A, B, C, D,... , Z. 


(a) How many such codes can be made? 
(b) How many such codes have no two consecutive letters the same? 


. Acoin is tossed 10 times in a row. How many possible sequences of heads and 


tails are there? 


. Anew car comes in a choice of five colors, three engine sizes and two transmis- 


sions. How many different combinations are there? 


A dice is tossed four times in a row. There are many possible outcomes, such as 
CGI), or MEL). How many different outcomes are possible? 


Free PDF version [S)= 


74 Counting 


3.3 The Addition and Subtraction Principles 


We now discuss two new counting principles, the addition and subtraction 
principles. Actually, they are not entirely new—you’ve used them intuitively 
for years. Here we give names to these two fundamental thought patterns, 
and phrase them in the language of sets. Doing this helps us recognize when 
we are using them, and, more importantly, it helps us see new situations in 
which they can be used. 

The addition principle simply asserts that if a set can be broken into 
pieces, then the size of the set is the sum of the sizes of the pieces. 


Fact 3.2 (Addition Principle) 
Suppose a finite set X can be decomposed as a union X = X,UX9U::-UXn, 
where X; Xj = @ whenever i # j. Then |X| =|X1|+|X2|+---+|Xn]. 


ERATE 


In our first example we will rework an instance where we used the 
addition principle naturally, without comment: in Part (c) of Example 3.3. 


Example 3.5 How many length-4 non-repetitive lists can be made from 
the symbols A, B, C, D, E, F, G, if the list must contain an E? 


In Example 3.3 (c) our approach was to divide these lists into four types, 
depending on whether the E is in the first, second, third or fourth position. 
Type 1 Type 2 Type 3 Type 4 


EK E E E 
6 5 4 6 5 4 6 5 4 6 5 4 


Then we used the multiplication principle to count the lists of type 1. 
There are 6 choices for the second entry, 5 for the third, and 4 for the fourth. 
This is indicated above, where the number below a box is the number of 
choices we have for that position. The multiplication principle implies that 
there are 6-5-4 = 120 lists of type 1. Similarly there are 6-5-4 = 120 lists of 
types 2, 3, and 4. 


Xi EABC Xo AEBC Xs ABEC Xa ABCE 
EACB AECB ACEB ACBE 
x EBAC BEAC BAEC BACE 


Richard Hammack Book of Proof 


The Addition and Subtraction Principles 75 


We then used the addition principle intuitively, conceiving of the lists to be 
counted as the elements of a set X, broken up into parts X1, Xo, X3 and Xq4, 
which are the lists of types 1, 2, 3 and 4, respectively. 

The addition principle says that the number of lists that contain an E is 
|X| = |X 1| + |Xeq] + |X3] + |X4| = 120+ 120+ 120+ 120= 480. 


We use the addition principle when we need to count the things in some 
set X. If we can find a way to break X up as X = X,;UXoU---UX,,, where each 
X; is easier to count than X, then the addition principle gives an answer of 
|X| = |Xy| + |Xql+|X3l+---+|Xal- 

But for this to work the intersection of any two pieces X; must be @, 
as stated in Fact 3.2. For instance, if X; and Xz shared an element, then 
that element would be counted once in [X;| and again in |X»9|, and we’d 
get |X| < |X 1|+|Xol+---+|X,|. (This is precisely the double counting issue 
mentioned after Example 3.3.) 


Example 3.6 How many even 5-digit numbers are there for which no 
digit is 0, and the digit 6 appears exactly once? For instance, 55634 and 
16118 are such numbers, but not 63304 (has a 0), nor 63364 (too many 6’s), 
nor 55637 (not even). 


Solution: Let X be the set of all such numbers. The answer will be |X|, so 
our task is to find |X|. Put X =X ,UX_gUX3UX4UXs, where X; is the set of 
those numbers in X whose ith digit is 6, as diagramed below. Note X;.X; = @ 
whenever i # j because the numbers in X; have their 6 in a different position 
than the numbers in X;. Our plan is to use the multiplication principle to 
compute each |X;|, and follow this with the addition principle. 


Xj X92 X3 X4 X5 


6 6 6 6 6 
88 8 3 8 8 8 3 8 8 8 3 8 8 8 3 8 8 8 8 


The first digit of any number in X is 6, and the three digits following it 
can be any of the ten digits except 0 (not allowed) or 6 (already appears). 
Thus there are eight choices for each of three digits following the first 6. 
But because any number in Xj is even, its final digit must be one of 2,4 
or 8, so there are just three choices for this final digit. By the multiplication 
principle, |X |=8-8-8-3= 1536. Likewise |X9| = |X3| =|X4| =8-8-8-3 = 1536. 

But X;5 is slightly different because we do not choose the final digit, 
which is already 6. The multiplication principle gives |X5| = 8-8-8-8 = 4096. 

The addition principle gives our final answer. The number of even 5- 
digit numbers with no 0’s and one 6 is |X| = |X1| + |X| +|X3|+|X4|+|X5] = 
1536 + 1536 + 1536 + 1536 + 4096 = 10,240. 


Free PDF version [S)= 


76 Counting 


Now we introduce our next counting U 
method, the subtraction principle. To set 
it up, imagine that a set X is a subset of a 
universal set U, as shown on the right. (=k 
The complement X = U-X is shaded. 
Suppose we wanted to count the things 
in this shaded region. Surely this is the number of things in U minus 


the number of things in X, which is to say |U — X| =|U|-|X|. That is the 
subtraction principle. 


Fact 3.3. (Subtraction Principle) 
If X is a subset of a finite set U, then |X| =|U|-|X1. 
In other words, if X <U then |U - X| =|U|-|X\. 


The subtraction principle is used in situations where it is easier to count 
the things in some set U that we wish to exclude from consideration than 
it is to count those things that are included. We have seen this kind of 
thinking before. We quietly and naturally used it in part (d) of Example 3.3. 
For convenience we repeat that example now, casting it into the language 
of the subtraction principle. 


Example 3.7 How many length-4 lists can be made from the symbols 
A, B, C, D, E, F, G if the list has at least one E, and repetition is allowed? 


Solution: Such a list might contain one, two, three or four E’s, which could 
occur in various positions. This is a fairly complex situation. 

But it is very easy to count the set U of all lists of length 4 made from 
A, B, C, D, E, F, G if we don’t care whether or not the lists have any E’s. 
The multiplication principle says |U| = 7-7-7-7= 2401. 

It is equally easy to count the set X of those lists that contain no E’s. 
The multiplication principle says |X| =6-6-6-6 = 1296. 

We are interested in those lists that have at least one E, and this is 
the set U—X. By the subtraction principle, the answer to our question is 
|U -X| =|U| - |X| = 2401 — 1296 = 1105. 


As we continue with counting we will have many opportunities to use 
the multiplication, addition and subtraction principles. Usually these will 
arise in the context of other counting principles that we have yet to explore. 
It is thus important that you solidify the current ideas now, by working 
some exercises before moving on. 


Richard Hammack Book of Proof 


The Addition and Subtraction Principles 77 


Exercises for Section 3.3 


1. 


10. 


11. 


12. 


Five cards are dealt off of a standard 52-card deck and lined up in a row. How 
many such lineups are there that have at least one red card? How many such 
lineups are there in which the cards are either all black or all hearts? 


. Five cards are dealt off of a standard 52-card deck and lined up in a row. How 


many such lineups are there in which all 5 cards are of the same suit? 


. Five cards are dealt off of a standard 52-card deck and lined up in a row. How 


many such lineups are there in which all 5 cards are of the same color (i.e., all 
black or all red)? 


. Five cards are dealt off of a standard 52-card deck and lined up in a row. How 


many such lineups are there in which exactly one of the 5 cards is a queen? 


. How many integers between 1 and 9999 have no repeated digits? How many 


have at least one repeated digit? 


. Consider lists made from the symbols A, B, C, D, E, with repetition allowed. 


(a) How many such length-5 lists have at least one letter repeated? 
(b) How many such length-6 lists have at least one letter repeated? 


. A password on a certain site must be five characters long, made from letters 


of the alphabet, and have at least one upper case letter. How many different 
passwords are there? What if there must be a mix of upper and lower case? 


. This problem concerns lists made from the letters A, B, C, D, E, F, G, H, I, J. 


(a) How many length-5 lists can be made from these letters if repetition is 
not allowed and the list must begin with a vowel? 

(b) How many length-5 lists can be made from these letters if repetition is 
not allowed and the list must begin and end with a vowel? 

(c) How many length-5 lists can be made from these letters if repetition is 
not allowed and the list must contain exactly one A? 


. Consider lists of length 6 made from the letters A, B, C, D, E, F, G, H. How 


many such lists are possible if repetition is not allowed and the list contains two 
consecutive vowels? 

Consider the lists of length six made with the symbols P, R, O, F, S, where 
repetition is allowed. (For example, the following is such a list: (P.R,O,O,F,S).) 
How many such lists can be made if the list must end in an S and the symbol O 
is used more than once? 

How many integers between 1 and 1000 are divisible by 5? How many are not 
divisible by 5? 

Six math books, four physics books and three chemistry books are arranged on 
a shelf. How many arrangements are possible if all books of the same subject 
are grouped together? 


Free PDF version C\E 


78 Counting 


3.4 Factorials and Permutations 


In working examples from the previous two sections you may have noticed 
that we often need to count the number of non-repetitive lists of length n 
that are made from n symbols. This kind of problem occurs so often that a 
special idea, called a factorial, is used to handle it. 

The table below motivates this. The first column lists successive integer 
values n, from 0 onward. The second contains a set {a,b,...} of n symbols. 
The third column shows all the possible non-repetitive lists of length n 
that can be made from these symbols. Finally, the last column tallies up 
how many lists there are of that type. When n = 0 there is only one list of 
length 0 that can be made from 0 symbols, namely the empty list (). Thus 
the value 1 is entered in the last column of that row. 


n | Symbols | Non-repetitive lists of length n made from the symbols | n! 


0} i 0 1 

1 | {a} a 1 

2 | {a,b} ab, ba 2 

3 | {a,b,c} abc, acb, bac, bea, cab, cba 6 
abcd, acbd, bacd, bcad, cabd, cbad, 

4 fe bea abdc, acdb, badc, bcda, cadb, cbda, oA 


adbc, adcb, bdac, bdca, cdab, cdba, 
dabc, dacb, dbac, dbca, dcab, dcba 


For n > 0, the number that appears in the last column can be computed 
using the multiplication principle. The number of non-repetitive lists of 
length n that can be made from n symbols is n(n—1)(n—2)---3-2-1. Thus, for 
instance, the number in the last column of the row for n = 4 is 4-:3-2-1= 24. 

The number that appears in the last column of Row n is called the 
factorial of n. It is denoted with the special symbol n!, which we pronounce 
as “n factorial." Here is the definition: 


Definition 3.1 Ifn is a non-negative integer, then n! is the number of 
lists of length n that can be made from n symbols, without repetition. 
Thus 0!=1 and 1!=1. Ifn>1, then n! =n(n—-1)(n—-2)-:-3-2-1. 


Richard Hammack Book of Proof 


Factorials and Permutations 719 


It follows that 0! = 

i 
Zl = 
af = 
Al = 
5! = 
6! = 6- 


oR WN HH 


-1=120 
-2-1=720, and soon. 

Students are often tempted to say 0!=0, but this is wrong. The correct 
value is 0! =1, as the above definition and table show. Here is another way 
to see that 0! must equal 1: Notice that 5!= 5-4-3-2-1=5-(4-3-2-1)= 5-4. 
Also 4!= 4-3-2-1= 4-(3-2-1)= 4-3!. Generalizing this, we get a formula. 


ni=n-(n-1)! (3.1) 


Plugging in n = 1 gives 1!=1-(1-1)! =1-0!, that is, 1!=1-0!. If we mistakenly 
thought 0! were 0, this would give the incorrect result 1! = 0. 


Example 3.8 This problem involves making lists of length seven from the 
letters a, b, c, d, e, fandg. 


(a) How many such lists are there if repetition is not allowed? 

(b) How many such lists are there if repetition is not allowed and the first 
two entries must be vowels? 

(c) How many such lists are there in which repetition is allowed, and the 
list must contain at least one repeated letter? 


To answer the first question, note that there are seven letters, so the 
number of lists is 7! = 5040. To answer the second question, notice that 
the set {a, b, c, d, e, f, g} contains two vowels and five consonants. Thus in 
making the list the first two entries must be filled by vowels and the final 
five must be filled with consonants. By the multiplication principle, the 
number of such lists is 2-1-5-4-3-2-1= 2!5! = 240. 

To answer part (c) we use the subtraction principle. Let U be the set of all 
lists made from a, b, c, d, e, f, g, with repetition allowed. The multiplication 
principle gives |U| =7-7-7-7-7-7-7=7' = 823,543. Notice that U includes 
lists that are non-repetitive, like (a,g,fb,d,c,e), as well as lists that have 
some repetition, like (f,g,b,g,a,a,a). We want to find the number of lists that 
have at least one repeated letter, so we will subtract away from U all those 
lists that have no repetition. Let X ¢U be those lists that have no repetition, 
so |X|=7!. Thus the answer to our question is |U — X| =|U|-|X|=7'-7!= 
823,543 — 5040 = 818,508. 


Free PDF version [S)= 


80 Counting 


In part (a) of Example 3.8 we counted the number of non-repetitive lists 
made from all seven of the symbols in the set X = {a, b, c, d, e, f, g}, and 
there were 7! = 5040 such lists. Any such list, such as bcedagf, gfedcba or 
abcdefg is simply an arrangement of the elements of X in a row. There is a 
name for such an arrangement. It is called a permutation of X. 

A permutation of a set is an arrangement of all of the set’s elements 
in a row, that is, a list without repetition that uses every element of the set. 
For example, the permutations of the set X = {1,2,3} are the six lists 


123, 132, 218, 231, 312, 321. 


That we get six different permutations of X is predicted by Definition 3.1, 
which says there are 3! = 3-2-1=6 non-repetitive lists that can be made 
from the three symbols in X. 

Think of the numbers 1, 2 and 3 as representing three books. The above 
shows that there are six ways to arrange them on a shelf. 

From a deck of cards you take the four queens and lay them in a row. 
By the multiplication principle there are 4! = 4-3-2-1= 24 ways to do this, 
that is, there are 24 permutations of the set of four Queen cards. 


elelele}| jelfelele| jelelfele| jelfelele| jelefele| jelelele 
alolalo} jafofofal falefolo! [afalofol falofolal falofalo 
elelele}| feljelele| felelele| jefelele| jelelele| felelele 
ofalalol Jofalofal jofalalso jofalolal folofalel fofolalia 
elelele}| jelelele| jelelele| jefelele| jelelele| jelelele 
alolalol] jafofofal jafaloso| jalalollol falofofal falojalo 
elelele| jelelele| jelelele| jelfelele| jelelelel jelelele 
sfolfalal jofolfafal folafolfel [ofalelfol] jolelfalo] jolfefolas 
In general, a set with n elements will have n! different permutations. 
Above, the set {1,2,3} has 3! = 6 permutations, while 4/2}, ba ¢}/¢/} has 


4! = 24 permutations. The set {a,b,c,d,e,fg} has 7! = 5040 permutations, 
though there’s not much point in listing them all out. The important thing 
is that the factorial counts the number of permutations. 

In saying a permutation of a set is an arrangement of its elements in a 
row, we are speaking informally because sometimes the elements are not 
literally in a row. Imagine a classroom of 20 desks, in four rows of five desks 
each. Let X be a class (set) of 20 students. If the students walk in and seat 
themselves, one per desk, we can regard this as a permutation of the 20 
students because we can number the desks 1,2,3,...,20 and in this sense 
the students have arranged themselves in a list of length 20. There are 
20! = 2,432,902, 008, 176,640,000 permutations of the students. 


Richard Hammack Book of Proof 


Factorials and Permutations 81 


Now we discuss a variation of the idea of a permutation of a set X. 
Imagine taking some number k < |X| of elements from the set X and then 
arranging them in a row. The result is what we call a k-permutation of X. 
A permutation of X is a non-repetitive list made from all elements of X. 
A k-permutation of X is a non-repetitive list made from k elements of X. 


For example, take X = {a,b,c,d}. The 1-permutations of X are the lists 
we could make with just one element from X. There are only 4 such lists: 


a b Cc d. 


The 2-permutations of X are the non-repetitive lists that we could make 
from two elements of X. There are 12 of them: 


ab ac ad ba bc bd ca cb cd da db de. 


Even before writing them all down, we’d know there are 12 of them because 
in making a non-repetitive length-2 list from X we have 4 choices for the first 
element, then 3 choices for the second, so by the multiplication principle 
the total number of 2-permutations of X is 4-3 = 12. 

Now let’s count the number of 3-permutations of X. They are the length- 
3 non-repetitive lists made from elements of X. The multiplication principle 
says there will be 4-3-2 =24 of them. Here they are: 


abe acb bac bca cab _ cba 
abd adb bad bda dab dba 
acd adc cad cda dac dca 
bcd bde cbd cdb dbc dcb 


The 4-permutations of X are the non-repetitive lists made from all 4 
elements of X. These are simply the 4! = 4-3-2-1=24 permutations of X. 

Let’s go back and think about the 0-permutations of X. They are the 
non-repetitive lists of length 0 made from the elements of X. Of course 
there is only one such list, namely the empty list (). 


Now we are going to introduce some notation. The expression P(n,k) 
denotes the number of k-permutations of an n-element set. By the examples 
on this page we have P(4,0) = 1, P(4,1) = 4, P(4,2) = 12, P(4,3) = 24, and 
P(4,4) = 24. 

What about, say, P(4,5)? This is the number of 5-permutations of a 
4-element set, that is, the number of non-repetitive length-5 lists that can 
be made from 4 symbols. There is no such list, so P(4,5) = 0. 


Free PDF version [S)= 


82 Counting 


If n >0, then P(n,k) can be computed with the multiplication principle. 
In making a non-repetitive length-z list from n symbols we have n choices 
for the 1st entry, n—1 for the 2nd, n —2 for the 3rd, and n —3 for the 4th. 


Ist 2nd 8rd 4th 5th ae kth 
Te a. dicey alee o fiGe hil 
n (n-1)(n—2)(n—38)(n—-4) ane (n-—k+1) 


Notice that the number of choices for the ith position is n—i+1. For example, 
the 5th position has n—5+1=n-4 choices. Continuing in this pattern, the 
last (kth) entry has n—k+1 choices. Therefore 


P(n,k) =n(n-1)(n—-2)-:-(n-k +1). (3.2) 


All together there are k factors in this product, so to compute P(n,k) just 
perform n(n—1)(n—2)(n—8)--- until you’ve multiplied k numbers. Examples: 


P(10,1) = 10 = 10 

P(10,2) = 10-9 = 90 

P(10,3) = 10-9-8 = 720 
9-8-7 = 5040 


P(10,4) = 10- 


P(10,10) = 10-9-8-7-6-5-4-3-2-1 = 3,628,800 
P(10,11) = 10-9-8-7-6-5-4-3-2-1-0 = 0. 


Note P(10,11) = 0, as the 11th factor in the product is 0. This makes 
sense because P(10,11) is the number of non-repetitive length-11 lists made 
from just 10 symbols. There are no such lists, so P(10,11)=0 is right. In 
fact you can check that Equation (3.2) gives P(n,k) =0 whenever k >n. 

Also notice above that P(10,10)=10!. In general P(n,n)=n!. 

We now derive another formula for P(n,k), one that works for 0<k<n. 
Using Equation (3.2) with cancellation and the definition of a factorial, 


P(n,k) = n(n-1)(n-2)---(n—k+1) 


n(n-1)\(n-2)---(n-—k+1)(n—-k)\(n—-k-1)---3-2-1 n! 


(n—k\n—k-1)--3-2-1  (n—k)! 
To illustrate, let’s find P(8,5) in two ways. Equation (3.2) says P(8,5) = 


8! 40,320 
8-7-6-5-4= 6720. By the above formula, P(8,5) = (5)! Aa age 6720. 


Richard Hammack Book of Proof 


Factorials and Permutations 83 


We summarize these ideas in the following definition and fact. 


Fact 3.4 A k-permutation of an n-element set is a non-repetitive 
length-k list made from elements of the set. Informally we think of 
a k-permutation as an arrangement of k of the set’s elements in a row. 


The number of k-permutations of an n-element set is denoted P(n,k), and 


P(n,k) = n(n-1)(n-2)---(n-k+1). 


n! 


If 0<k<n, then P(n,k) = n(n-1)\(n-2)---(n—k4+1) = ——. 
(n—k)! 


Notice that P(n,0)= oe = nt = 1, which makes sense because only one 
list of length 0 can be made from n symbols, namely the empty list. Also 
P(0,0)= ear 7 a = + = 1, which is to be expected because there is only one 
list of length 0 that can be made with 0 symbols, again the empty list. 


Example 3.9 Ten contestants run a marathon. All finish, and there are 
no ties. How many different possible rankings are there for first-, second- 
and third-place? 


Solution: Call the contestants A, B, C, D, E, F, G, H, land J. A ranking 
of winners can be regarded as a 3-permutation of the set of 10 contestants. 
For example, ECH means E in first-place, C in second-place and H in third. 
Thus there are P(10,3) = 10-9-8 = 720 possible rankings. 


Example 3.10 You deal five cards off of a standard 52-card deck, and line 
them up in a row. How many such lineups are there that either consist of 
all red cards, or all clubs? 


Solution: There are 26 red cards. The number of ways to line up five of 
them is P(26,5) = 26-25 -24-23-22 = 7,893,600. 


There are 13 club cards (which are black). The number of ways to line up 
five of them is P(13,5) = 13-12-11-10-9 = 154,440. 


By the addition principle, the answer to our question is that there are 
P(26,5)+P(13,5) = 8,048,040 lineups that are either all red cards, or all club 
cards. 


Notice that we do not need to use the notation P(n,k) to solve the prob- 
lems on this page. Straightforward applications of the multiplication and 
addition principles would suffice. However, the P(n,k) notation often proves 
to be a convenient shorthand. 


Free PDF version [S)= 


84 Counting 


Exercises for Section 3.4 


1. 
. For which values of n does n! have n or fewer digits? 


What is the smallest n for which n! has more than 10 digits? 


. How many 5-digit positive integers are there in which there are no repeated 


digits and all digits are odd? 
100! 


. Using only pencil and paper, find the value of >--. 


120! 


. Using only pencil and paper, find the value of 77;. 
. There are two 0’s at the end of 10! = 3,628,800. Using only pencil and paper, 


determine how many 0’s are at the end of the number 100!. 


. Find how many 9-digit numbers can be made from the digits 1, 2, 3, 4, 5, 6, 7, 


8, 9 if repetition is not allowed and all the odd digits occur first (on the left) 
followed by all the even digits (i.e., as in 137598264, but not 123456789). 


. Compute how many 7-digit numbers can be made from the digits 1, 2, 3, 4, 5, 6, 7 


if there is no repetition and the odd digits must appear in an unbroken sequence. 
(Examples: 3571264 or 2413576 or 2467531, etc., but not 7234615.) 


. How many permutations of the letters A, B, C, D, E, F, G are there in which the 


three letters ABC appear consecutively, in alphabetical order? 


. How many permutations of the digits 0, 1, 2, 3, 4, 5, 6, 7, 8, 9 are there in which 


the digits alternate even and odd? (For example, 2183470965.) 


. You deal 7 cards off of a 52-card deck and line them up in a row. How many 


possible lineups are there in which not all cards are red? 


. You deal 7 cards off of a 52-card deck and line them up in a row. How many 


possible lineups are there in which no card is a club? 


. How many lists of length six (with no repetition) can be made from the 26 letters 


of the English alphabet? 


. Five of ten books are arranged on a shelf. In how many ways can this be done? 
. In aclub of 15 people, we need to choose a president, vice-president, secretary, 


and treasurer. In how many ways can this be done? 


. How many 4-permutations are there of the set {A,B,C,D,E,F} if whenever A 


appears in the permutation, it is followed by E? 


. Three people in a group of ten line up at a ticket counter to buy tickets. How 


many lineups are possible? 


. There is a very interesting function I :[0,oo) — R called the gamma function. 


It is defined as I(x) = [5° t* te ‘dt. It has the remarkable property that if x €N, 
then I'(x) = (x-1)!. Check that this is true for x = 1,2,3,4. 

Notice that this function provides a way of extending factorials to numbers other 
than integers. Since I'(n) =(n—1)! for all ne N, we have the formula n! = I'(n +1). 
But IF can be evaluated at any number in [0,0o), not just at integers, so we have 
a formula for n! for any real number n € [0,oo). Extra credit: Compute z!. 


Richard Hammack Book of Proof 


Counting Subsets 85 


3.5 Counting Subsets 


The previous section dealt with counting lists made by selecting k entries 
from a set of n elements. We turn now to a related question: How many 
subsets can be made by selecting k elements from a set with n elements? 

To see the difference between these two problems, take A = {a,b,c,d,e}. 
Consider the non-repetitive lists made from selecting two elements from A. 
Fact 3.4 says there are P(5,2) =5-4 = 20 such lists, namely 


(a,b), (a,c), (a,d), (a,e), (b,c), (b,d), (b,e), (c,d), (c,e), (d,e), 
(b,a), (c,a), (d,a), (e,a), (c,b), (d,b), (e,b), (d,c), (e,c), (e,d). 


But there are only ten 2-element swbsets of A. They are 
{a,b}, {a,ch, {a,d}, {a,e}, {b,c}, {b,d}, {b,e$, {c,d}, {c,e}, {d,e}. 


The reason that there are more lists than subsets is that changing the 
order of the entries of a list produces a different list, but changing the order 
of the elements of a set does not change the set. Using elements a,b € A, we 
can make two lists (a,b) and (b,a), but only one subset {a,b}. 

This section is concerned with counting subsets, not lists. As noted above, 
the basic question is this: How many subsets can be made by choosing k 
elements from an n-element set? We begin with some notation that gives a 
name to the answer to this question. 


Definition 3.2 Ifn and & are integers, then (j) denotes the number of 
subsets that can be made by choosing k elements from an n-element set. 
We read (j;) as “n choose k.” (Some textbooks write C(n,k) instead of (7).) 


This is illustrated in the following table that tallies the k-element subsets 
of the 4-element set A = {a,b,c,d}, for various values of k. 


k | k-element subsets of A = {a,b,c,d} (7) 


ai (4) =0 
0\@ (()=1 
1 | {aj,{d},{e}, {a} ()=4 
2 | {a,b},{a,c}{a,d}, {b,c}, {b,d},{e,d} | ()=6 
3 | {a,b,c}, {a,b,d},{a,c,d},{b,c,d} ()=4 
4 | {a,b,c,d} (=i 
5 (s)=0 


Free PDF version [S)= 


86 Counting 


The values of k appear in the far-left column of the table. To the right of 
each & are all of the subsets (if any) of A of size k. For example, when k = 1, 
set A has four subsets of size k, namely {a}, {b}, {c} and {d}. Therefore 
({)=4. When & = 2 there are six subsets of size k so (5) =6. 

When k = 0, there is only one subset of A that has cardinality k, namely 
the empty set, ¢. Therefore (5) = 1. 

Notice that if k is negative or greater than |A|, then A has no subsets of 
cardinality , so (7) = 0 in these cases. In general (’) = 0 whenever k <0 or 
k>n. In particular this means (7) = 0 ifn is negative. 

Although it was not hard to work out the values of ({) by writing out 
subsets in the above table, this method of actually listing sets would not 
be practical for computing (;) when n and k are large. We need a formula. 
To find one, we will now carefully work out the value of (3) in a way that 
highlights a pattern that points the way to a formula for any (7). 

To begin, note that (3) is the number of 3-element subsets of {a,b,c,d,e}. 
These are listed in the top row of the table below, where we see G )=10. 
The column under each subset tallies the 3! = 6 permutations of that subset. 
The first subset {a,b,c} has 3! =6 permutations; these are listed below it. 
The second column tallies the permutations of {a,b,d}, and so on. 


0 


{a,b,c} {a,b,d} {a,b,e} {a,c,d} {a,c,e} {a,d,e} {b,c,d} {b,c,e} {b,d,e} {c,d,e} 


The body of this table has (3) columns and 3! rows, so it has a total of 
3! (3) lists. But notice also that the table consists of every 3-permutation of 
{a,b,c,d,e}. Fact 3.4 says that there are P(5,3) = —25; such 3-permutations. 
nee the total number of nee ce the table can be written as either 3!(3) or 


which is to say 31(3 Jax ane Dividing both sides by 3! yields 


5) ss! 
3] 35-3)! 


Working this out, you will find that it does give the correct value of 10. 


co aor ? 


Richard Hammack Book of Proof 


Counting Subsets 87 


But there was nothing special about the values 5 and 3. We could do the 
above analysis for any (7) instead of (3). The table would have (;) columns 
and k! rows. We would get 

n\ _ n! 
k} k\un-k)! 


We have established the following fact, which holds for all k,n € Z. 


n} 


~ Bink)! 


Fact 3.5 If0<k<n, then 4 : 


Otherwise 4 =0. 


Let’s now use our new knowledge to work some exercises. 


Example 3.11 How many size-4 subsets does {1,2,3,4,5,6,7,8,9} have? 


-.(9)_ 9! _ sO! ss 29-8 -7-6-5! _ 9-8-7-6 _ 9-8-7-6 _ 
The answer is (4) = 49-4! — 45! — "415! 4! 24 126. 


Example 3.12 How many 5-element subsets of A = {1,2,3,4,5,6,7,8,9} 
have exactly two even elements? 

Solution: Making a 5-element subset of A with exactly two even elements 
is a 2-step process. First select two of the four even elements from A. There 
are (3) = 6 ways to do this. Next, there are (3) = 10 ways to select three 
of the five odd elements of A. By the multiplication principle, there are 
(5)(2) = 6- 10 = 60 ways to select two even and three odd elements from A. So 
there are 60 5-element subsets of A with exactly two even elements. 


Example 3.13 A single 5-card hand is dealt off of a standard 52-card deck. 
How many different 5-card hands are possible? 


Solution: Think of the deck as a set D of 52 cards. Then a 5-card hand is 
just a 5-element subset of D. There are many such subsets, such as 


7/ | 2) ] 3] | Al] 5 
HPI PlO Pla Pld ff 
Thus the number of 5-card hands is the number of 5-element subsets of D, 
which is 


2 2! 52-51-50-49-48-47! 52-51-50-49-4 
ae 52!_ _ 52-51-50-49-48-47! _ 52-51:50-49-48 5 64. oa 


5] 5!-47! | 5!-47! 5! 


Answer: There are 2,598,960 different five-card hands that can be dealt 
from a deck of 52 cards. 


Free PDF version [S)= 


88 Counting 


Example 3.14 This problem concerns 5-card hands that can be dealt off 
of a 52-card deck. How many such hands are there in which two of the cards 
are clubs and three are hearts? 


Solution: Such a hand is described by a list of length two of the form 


(flere ticble lop) 
where the first entry is a 2-element subset of the set of 13 club cards, and 
the second entry is a 3-element subset of the set of 13 heart cards. There 
are (';’) choices for the first entry and ('?) choices for the second, so by 
the multiplication principle there are (*)("?) = st), 724, = 22,308 such lists. 


Thus there are 22,308 such 5-card hands. 


Example 3.15 A lottery features a bucket of 36 balls numbered 1 through 
36. Six balls will be drawn randomly. For $1 you buy a ticket with six blanks: 
OOOOOO| You fill in the blanks with six different numbers between 1 
and 36. You win $1,000,000 if you chose the same numbers that are drawn, 
regardless of order. What are your chances of winning? 


Solution: In filling out the ticket you are choosing six numbers from a set of 
36 numbers. Thus there are (2) = EO = 1,947,792 different combinations 
of numbers you might write. Only one of these will be a winner. Your 


chances of winning are one in 1,947,792. 


Example 3.16 How many 7-digit binary strings (0010100, 1101011, etc.) 
have an odd number of 1’s? 


Solution: Let A be the set of all 7-digit binary strings with an odd number 
of 1’s, so the answer will be |A|. To find |A|, we break A into smaller parts. 
Notice any string in A will have either one, three, five or seven 1’s. Let Ai be 
the set of 7-digit binary strings with only one 1. Let A3 be the set of 7-digit 
binary strings with three 1’s. Let As be the set of 7-digit binary strings with 
five 1’s, and let A7 be the set of 7-digit binary strings with seven 1’s. Then 
A=A,UA3UA5UAz7. Any two of the sets A; have empty intersection, so the 
addition principle gives |A| = |A1|+|A3|+|As5|+|Az7l. 

Now we must compute the individual terms of this sum. Take As3, the 
set of 7-digit binary strings with three 1’s. Such a string can be formed by 
selecting three out of seven positions for the 1’s and putting 0’s in the other 
spaces. Thus |As| = (3). Similarly |Aj|= (1), |As| = (2), and |A71 = (4). 
Answer: |A| = |Aj|+/A3|+1A5|+1Azl = (7) + (2) + (2 + (7) = 74354 214+ 1= 64. 
There are 64 7-digit binary strings with an odd number of 1’s. 


Richard Hammack Book of Proof 


Counting Subsets 89 


Exercises for Section 3.5 


1. 


Suppose a set A has 37 elements. How many subsets of A have 10 elements? 
How many subsets have 30 elements? How many have 0 elements? 


. Suppose A is a set for which |A| = 100. How many subsets of A have 5 elements? 


How many subsets have 10 elements? How many have 99 elements? 


. Aset X has exactly 56 subsets with 3 elements. What is the cardinality of X? 


4, Suppose a set B has the property that |{x :X€ A(B),|X|= 6}| = 28. Find |B]. 


a 


. How many 16-digit binary strings contain exactly seven 1’s? (Examples of such 


strings include 0111000011110000 and 0011001100110010, etc.) 


. [{X € A({0, 1,2,3,4,5,6,7,8,9}) :|X| =4}| = 
» |{X € A({0, 1,2,3,4,5,6,7,8,9}) :|X] <4} = 
. This problem concerns lists made from the symbols A, B, C, D, E, F, G, H, I. 


(a) How many length-5 lists can be made if there is no repetition and the list 
is in alphabetical order? (Example: BDEFI or ABCGH, but not BACGH.) 

(b) How many length-5 lists can be made if repetition is not allowed and the 
list is not in alphabetical order? 


. This problem concerns lists of length 6 made from the letters A,B,C,D,E,F, 


without repetition. How many such lists have the property that the D occurs 
before the A? 


. A department consists of 5 men and 7 women. From this department you select 


a committee with 3 men and 2 women. In how many ways can you do this? 


. How many positive 10-digit integers contain no 0’s and exactly three 6’s? 
. Twenty-one people are to be divided into two teams, the Red Team and the Blue 


Team. There will be 10 people on Red Team and 11 people on Blue Team. In 
how many ways can this be done? 


. Suppose n,k€ Z, and 0<k<n. Use Fact 3.5, the formula (j) = woo to show 


that (j,) = (,,",)- 


. Suppose n,ke Z, and 0<k<n. Use Definition 3.2 alone (without using Fact 3.5) 


to show that (7) =(,,",)- 


. How many 10-digit binary strings are there that do not have exactly four 1’s? 
. How many 6-element subsets of A = {0,1,2,3,4,5,6,7,8,9} have exactly three even 


elements? How many do not have exactly three even elements? 


. How many 10-digit binary strings are there that have exactly four 1’s or exactly 


five 1’s? How many do not have exactly four 1’s or exactly five 1’s? 


. How many 10-digit binary strings have an even number of 1’s? 
. A5-card poker hand is called a flush if all cards are the same suit. How many 


different flushes are there? 


Free PDF version C\E 


90 Counting 


3.6 Pascal’s Triangle and the Binomial Theorem 


There are some beautiful and significant patterns among the numbers Ce 
We now investigate a pattern based on one equation in particular. It happens 


that 
n+1 n n 
veaeend an 


for any integers n and k with 1<k <n. 

To see why this is true, notice that the left-hand side ("7") is the number 
of k-element subsets of the set A = {0,1,2,3,...,n}, which has n+ 1 elements. 
Such a subset either contains 0 or it does not. The (,”,) on the right is the 
number of k-element subsets of A that contain 0, because to make such 
a subset we can start with {0} and append it an additional & —- 1 numbers 
selected from {1,2,3,...,n}, and there are (,”,) ways to do this. Also, the (7) 
on the right is the number of subsets of A that do not contain 0, for it is the 
number of ways to select k elements from {1,2,3,...,n}. In light of all this, 
Equation (3.3) just states the obvious fact that the number of k-element 
subsets of A equals the number of k-element subsets that contain 0 plus 
the number of k-element subsets that do not contain 0. 

Having seen why Equation (3.3) is true, we now highlight it by arranging 
the numbers (7) in a triangular pattern. The left-hand side of Figure 3.3 
shows the numbers (7) arranged in a pyramid with (3) at the apex, just 
above a row containing G) with k =0 and k = 1. Below this is a row listing 
the values of (7) for k = 0,1,2, and so on. 


; (0) : 1 
: (0) ; (1) 3 1 #1 
0) 4), () 1 2 1 
lo) 0, G) 6), 1 3 8 1 
=o) - (a) , (a). (a), () 14 6 4 1 
6.6,0.0.0 0. 15 10 105 1 
i (0) 7 (3) . (3) 2 (3) ‘ (4) : (5) - (6) . 1 6 15 20 15 6 1 
(0)  @ &) @ @ & (7) io7 21 35 35 21 7 1 


Figure 3.3. Pascal’s triangle 


Any number (2) for 0< k<n in this pyramid is just below and between 


the two numbers (,”,) and (7) in the previous row. But Equation (3.3) says 
Gos) = (,",)+(;). Therefore any number (other than 1) in the pyramid is 
the sum of the two numbers immediately above it. 


Richard Hammack Book of Proof 


Pascal’s Triangle and the Binomial Theorem 91 


This pattern is especially evident on the right of Figure 3.3, where each 
(Z) is worked out. Notice how 21 is the sum of the numbers 6 and 15 above 
it. Similarly, 5 is the sum of the 1 and 4 above it and so on. 

This arrangement is called Paseal’s triangle, after Blaise Pascal, 1623-— 
1662, a French philosopher and mathematician who discovered many of its 
properties. We’ve shown only the first eight rows, but the triangle extends 
downward forever. We can always add a new row at the bottom by placing 
a 1 at each end and obtaining each remaining number by adding the two 
numbers above its position. Doing this in Figure 3.3 (right) gives a new 
bottom row 

1 8 28 56 70 56 28 8 1. 


This row consists of the numbers ($) for 0 < k < 8, and we have computed 
them without the formula 4 = Coot Any (/,) can be computed this way. 
The very top row (containing only 1) of Pascal’s triangle is called Row 0. 
Row 1 is the next down, followed by Row 2, then Row 3, etc. Thus Row n 
lists the numbers (7) for 0<k <n. Exercises 3.5.13 and 3.5.14 established 


O(c : 


for each 0<k <n. In words, the kth entry of Row n of Pascal’s triangle 
equals the (n—-k)th entry. This means that Pascal’s triangle is symmetric 
with respect to the vertical line through its apex, as is evident in Figure 3.3. 


1 1 
1 1 ie & ty 
1 2 1 1x2 + 2xy + 1y? 
1 3 3 1 Ix? + 8x%y + Bxy2 + 1y3 
1 4 6 4 1 Ixt + 4x3y +6x2y2+ dey? + 1y4 


1 5 10 10 5 1 1x5 + Baty +10x3 y2+10x7y3 + 5xy* + Ly? 


Figure 3.4. The n” row of Pascal’s triangle lists the coefficients of (x + y)” 


Notice that Row n appears to be a list of the coefficients of (x+ y)”. 
For example (x + y)? = 1x? +2xy+1y?, and Row 2 lists the coefficients 1 2 1. 
Also («+ y)? = 1x? +3x?y+3xy?2+1y?, and Row 3 is 1331. See Figure 3.4, 
which suggests that the numbers in Row n are the coefficients of (x + y)”. 


Free PDF version [S)= 


92 Counting 


In fact this turns out to be true for every n. This fact is known as the 
binomial theorem, and it is worth mentioning here. It tells how to raise 
a binomial x+y to a non-negative integer power n. 


Theorem 3.1 eure Theorem) If n is a non-negative integer, then 
(xt y)" = (g)a” + (Tx Ty + (B)x” Py? + (Bla Py? + (ay t+ Hy” 

For now we will be content to accept the binomial theorem without proof. 
(You will be asked to prove it in an exercise in Chapter 10.) You may find it 
useful from time to time. For instance, you can use it if you ever need to 


expand an expression such as (x + y)’. To do this, look at Row 7 of Pascal’s 
triangle in Figure 3.3 and apply the binomial theorem to get 


(x + yy! =x + 7x y + 21x° y? + 35x4y? + 35x y4 + 21x? y° + Txy® + y". 
For another example, 


(2Qa—b)* = ((2a)+(-b))* 
(2a)* + 4(2a)?(—b) + 6(2a)?(—b)? + 4(2a)(—b)? + (—b)4 
= 16a*—32a°b + 24a2b? - 8ab? + b+. 


Exercises for Section 3.6 


. Write out Row 11 of Pascal’s triangle. 

. Use the binomial theorem to find the coefficient of x®y® in (x + y)18. 

. Use the binomial theorem to find the coefficient of x® in (x +2)". 

. Use the binomial theorem to find the coefficient of x®y* in (3x—2y)%. 

. Use the binomial theorem to show Y7_,(7) = 2”. 

. Use Definition 3.2 (page 85) and Fact 1.3 (page 13) to show Y7_, (7) =2”. 

. Use the binomial theorem to show 57_, 3*(;) = 4”. 

. Use Fact 3.5 (page 87) to derive Equation 3.3 (page 90). 

. Use the binomial theorem to show (5)-(7)+(3)-(3)+(q) —---+(-D)"(") = 0, for n> 0. 
. Show that the formula k(7) = n(?7}) is true for all integers n,k with O<k <n. 
. Use the binomial theorem to show 9” = ?_,(-1)*(?)10"-*. 


. Show that ()(7) = (2) (tom): 

. Show that (3) = (5) +(@) +) + (9) +--+ ("2°). 

. The first five rows of Pascal’s triangle appear in the digits of powers of 11: 11° = 1, 
114=11, 11?=121, 11° = 1331 and 114 = 14641. Why is this so? Why does the 


pattern not continue with 115? 


oeomN HAP WO DS = 


et 
hR oO NW = © 


Richard Hammack Book of Proof 


The Inclusion-Exclusion Principle 93 


3.7 The Inclusion-Exclusion Principle 


Many counting problems involve computing the cardinality of a union AUB 
of two finite sets. We examine this kind of problem now. 

First we develop a formula for |A UB|. It is tempting to say that |A UB| 
must equal |A|+|B|, but that is not quite right. If we count the elements of 
A and then count the elements of B and add the two figures together, we 
get |A|+|B|. But if A and B have some elements in common, then we have 
counted each element in ANB twice. 


Therefore |A|+|B| exceeds |A UB| by |AnB|, and consequently |A UB| = 
|A|+|B|-|AnB|. This can be a useful equation. 


Fact 3.6 Inclusion-Exclusion Formula 
If A and B are finite sets, then |AUB|=|A|+|B|—-|AnBI. 


Notice that the sets A, B and AnB are all generally smaller than A UB, 
so Fact 3.6 has the potential of reducing the problem of determining |A U B| 
to three simpler counting problems. It is called the inclusion-exclusion 
formula because elements in ANB are included (twice) in |A|+|B|, then 
excluded when |AnB| is subtracted. Notice that if AnB = @, then we do 
in fact get |A UB| =|A|+|B|. (This is an instance of the addition principle!) 
Conversely, if |A UB|=|A|+|B|, then it must be that ANB=@. 

Example 3.17 A 3-card hand is dealt off of a standard 52-card deck. How 


many different such hands are there for which all three cards are red or all 
three cards are face cards? 


Solution: Let A be the set of 3-card hands where all three cards are red 
(i.e., either 9 or +). Let B be the set of 3-card hands in which all three cards 
are face cards (i.e., J,K or Q of any suit). These sets are illustrated below. 


A = {{ al >|, F ae salt al ee] (Red cards) 
9 9 © co Y 


aoe {\ K|| K| an x] J] It. aie aa, (Face cards) 
* 9 1\9 Plo 4 


3 
o 


bdo 
SS 
——. 


> 
° 
° 
3 


Free PDF version [S)= 


94 Counting 


We seek the number of 3-card hands that are all red or all face cards, and 
this number is |AUB|. By Fact 3.6, |AUB|=|A|+|B|—|AnB|. Let’s examine 
|A|,|B| and |AnB| separately. Any hand in A is formed by selecting three 
cards from the 26 red cards in the deck, so |A| = (a) Similarly, any hand in 
B is formed by selecting three cards from the 12 face cards in the deck, so 
|B| = (’7). Now think about AnB. It contains all the 3-card hands made up 
of cards that are red face cards. 


K\|| J Q|| JI} Q (Red face 
OTOP To Plo Plo IR llso Plo Phe fpr cards) 


The deck has only 6 red face cards, so |A NB] = (8). 


> 
=) 
by 
Il 
_—_ 
_"_— 
a 
‘a 
q 
ca) 


Now we can answer our question. The number of 3-card hands that are 
all red or all face cards is |A UB| =|A|+|B|-|AnBl = (°)+ ('2)- (8) = 2600+ 
220 — 20 = 2800. 


Example 3.18 A 3-card hand is dealt off of a standard 52-card deck. How 
many different such hands are there for which it is not the case that all 3 
cards are red or all three cards are face cards? 


Solution: We will use the subtraction principle combined with our answer 


to Example 3.17, above. The total number of 3-card hands is (°?) = ea = 


any = 223150 = 26-17-50 = 22,100. To get our answer, we must subtract from 
this the number of 3-card hands that are all red or all face cards, that is, 
we must subtract the answer from Example 3.17. Thus the answer to our 


question is 22, 100 — 2800 = 19,300. 


There is an analogue of Fact 3.6 that involves three sets. Consider three 
sets A, B and C, as represented in the following Venn Diagram. 


ax 
LY 


Using the same kind of reasoning that resulted in Fact 3.6, you can convince 
yourself that 


IAUBUC|=|A|+|B/+/C|-|ANB|-|ANC|-|BNC|+|AnNBnC|. (3.5) 


Richard Hammack Book of Proof 


The Inclusion-Exclusion Principle 95 


There’s probably not much harm in ignoring this one for now, but if you 
find this kind of thing intriguing you should definitely take a course in 
combinatorics. (Ask your instructor!) 


Exercises for Section 3.7 


1. 


At a certain university 523 of the seniors are history majors or math majors (or 
both). There are 100 senior math majors, and 33 seniors are majoring in both 
history and math. How many seniors are majoring in history? 


. How many 4-digit positive integers are there for which there are no repeated 


digits, or for which there may be repeated digits, but all digits are odd? 


. How many 4-digit positive integers are there that are even or contain no 0’s? 
. This problem involves lists made from the letters T, H, E, O, R, Y, with repetition 


allowed. 
(a) How many 4-letter lists are there that don’t begin with T, or don’t end in Y? 
(b) How many 4-letter lists are there in which the sequence of letters T, H, E 
appears consecutively (in that order)? 
(c) How many 6-letter lists are there in which the sequence of letters T, H, E 
appears consecutively (in that order)? 


. How many 7-digit binary strings begin in 1 or end in 1 or have exactly four 1’s? 
. Is the following statement true or false? Explain. If A; 1 Az: Ag3 = @, then 


|A, UAgUA3| = |Aq|+|Agl+/Agl. 


. Consider 4-card hands dealt off of a standard 52-card deck. How many hands 


are there for which all 4 cards are of the same suit or all 4 cards are red? 


. Consider 4-card hands dealt off of a standard 52-card deck. How many hands 


are there for which all 4 cards are of different suits or all 4 cards are red? 


. A 4-letter list is made from the letters L, I, S, T; E, D according to the following 


rule: Repetition is allowed, and the first two letters on the list are vowels or the 
list ends in D. How many such lists are possible? 


. How many 6-digit numbers are even or are divisible by 5? 
. How many 7-digit numbers are even or have exactly three digits equal to 0? 
. How many 5-digit numbers are there in which three of the digits are 7, or two of 


the digits are 2? 


. How many 8-digit binary strings end in 1 or have exactly four 1’s? 
. How many 3-card hands (from a standard 52-card deck) have the property that 


it is not the case that all cards are black or all cards are of the same suit? 


. How many 10-digit binary strings begin in 1 or end in 1? 


Free PDF version [S)= 


96 Counting 


3.8 Counting Multisets 


You have in your pocket four pennies, two nickels, a dime and two quarters. 
You might be tempted to regard this collection as a set 


{1,1,1,1,5,5, 10,25, 25}. 


But this is not a valid model of your collection of change, because a set 
cannot have repeated elements. To overcome this difficulty, we make a new 
construction called a multiset. A multiset is like a set, except that elements 
can be repeated. We will use square brackets [ ] instead of braces {} to denote 
multisets. For example, your multiset of change is 


[1,1151,5,;5,10, 25,25). 


A multiset is a hybrid of a set and a list; in a multiset, elements can be 
repeated, but order does not matter. For instance 


[1,1,1,1,5,5,10,25,25] = [25,5,1,1,10,1,1,5,25] 
= [25,10,25,1,5,1,5,1,1]. 


Given a multiset A, its cardinality |A| is the number of elements it 
has, including repetition. So if A =[1,1,1,1,5,5,10,25,25], then |A|=9. The 
multiplicity of an element x € A is the number of times that x appears, so 
1¢€A has multiplicity 4, while 5 and 25 each have multiplicity 2, and 10 has 
multiplicity 1. Notice that every set can be regarded as a multiset for which 
each element has multiplicity 1. In this sense we can think of ¢ = {}=[] as 
the multiset that has no elements. 

To illustrate the idea of multisets, consider the multisets of cardinality 2 
that can be made from the symbols {a,b,c,d}. They are 


{a,a] [a,b] [a,c] [a,d] [b,b6] [b,c] [b,d] [c,c] [c,d] I[d,d]. 


We have listed them so that the letters in each multiset are in alphabetical 
order (remember, we can order the elements of a multiset in any way we 
choose), and the 10 multisets are arranged in dictionary order. 

For multisets of cardinality 3 made from {a,b,c,d}, we have 


[a,a,a] [a,a,b] [a,a,c] [a,a,d] [a,b,b] 
[a,b,c] [a,b,d] [a,c,c] [a,c,d] [a,d,d] 
[b,b,b] [b,b,c] [b,b,d] [b,c,c] [b,c,d] 
[b,d,d] [c,c,c] [e,e,d] [c,d,d] [d,d,d]. 


Richard Hammack Book of Proof 


Counting Multisets 97 


Though X = {a,b,c,d} has no subsets of cardinality 5, there are many 
multisets of cardinality 5 made from these elements, including [a,a,a,a,a], 
[a,a,b,c,d] and [b,c,c,d,d], and so on. Exactly how many are there? 

This is the first question about multisets that we shall tackle: Given a 
finite set X, how many cardinality-k multisets can be made from X? 

Let’s start by counting the cardinality-5 multisets made from symbols 
X = {a,b,c,d}. (Our approach will lead to a general formula.) We know we 
can write any such multiset with its letters in alphabetical order. Tweaking 
the notation slightly, we could write any such multiset with bars separating 
the groupings of a,b,c,d, as shown in the table below. Notice that if a 
symbol does not appear in the multiset, we still write the bar that would 
have separated it from the others. 


Multiset | with separating bars | encoding 
[a,a,b,c,d] aa|b|c\d ak |x [ox [x 
[a,b,b,c,d] a|bb|c|d | kk | o& [> 
[a,b,c,c,d] a|b|cc|d «| | &[* 
[a,a,c,c,d] aa||cc|d «|| ex | 
[b,b,d,d,d] |bb||\ddd |x| | oe oe 
la,a,a,a,a] aaaaal|| oe ||| 


This suggests that we can encode the multisets as lists made from the two 
symbols * and |, with an « for each element of the multiset, as follows. 


xforeacha *foreachbB *foreachc ~* foreachd 


For examples see the right-hand column of the table. Any such encoding is 
a list made from 5 stars and 3 bars, so the list has a total of 8 entries. How 
many such lists are there? We can form such a list by choosing 3 of the 8 
positions for the bars, and filling the remaining five positions with stars. 
Therefore the number of such lists is (3) = $5 =56. 

That is our answer. There are 56 cardinality-5 multisets that can 


be made from the symbols in X = {a,b,c,d}. 


If we wanted to count the cardinality-3 multisets made from X, then the 
exact same reasoning would apply, but with 3 stars instead of 5. We’d be 
counting the length-6 lists with 3 stars and 3 bars. There are (8) =, = 20 
such lists. So there are 20 cardinality-3 multisets made from X = {a,b,c,d}. 
This agrees with our accounting on the previous page. 


Free PDF version [S)= 


98 Counting 


In general, given a set X = {x1,x9,...,X,} of n elements, any cardinality-k 
multiset made from its elements can be encoded in a star-and-bar list 


«x foreach x; *foreachxg * for each x3 « for each xp 
——_—_ | - eee | ees A _F 
REE [GOR PR ree ee | see eee ee ee 


Such a list has & stars (one for each element of the multiset) and n-1 
separating bars (a bar between each of the n groupings of stars). Therefore 
its length is k+n—1. We can make such a list by selecting n-—1 list positions 
out of k+n-1 positions for the bars and inserting stars in the left-over 
positions. Thus there are (a) such lists. Alternatively we could choose k 
positions for the stars and fill in the remaining n—k with bars, so there are 


(**7-+) such lists. Note that ("*?~1) = ("*"7') by Equation (3.4) on page 91. 


Let’s summarize our reckoning. 


Fact 3.7. The number of k-element multisets that can be made from the 
elements of an n-element set X = {x1,x9,...,x,} is 


bins 


This works because any cardinality-k multiset made from the n elements 
of X can be encoded in a star-and-bar list of length k +n-1, having form 


*« foreach x; +*foreachxg + for each x3 « for each xp, 
—_ Reo OO 
re sees | eee x 


with k stars and n—1 bars separating the n groupings of stars. Such a 
list can be made by selecting n—-1 positions for the bars, and filling the 


remaining positions with stars, and there are auras ways to do this. 


For example, the number of 2-element multisets that can be made from 
the 4-element set X = {a,b,c,d} is (?*}"') = (3) = 10. This agrees with our 
accounting of them on page 96. The number of 3-element multisets that 
can be made from the elements of X is Cas) = (8) = 20. Again this agrees 
with our list of them on page 96. 

The number of 1-element multisets made from X is (‘*f') = ({) = 4. 
Indeed, the four multisets are [a],[b],[c] and [d]. The number of 0-element 
multisets made from X is (°*¢"') = (3) =1. This is right, because there is 


only one such multiset, namely @. 


Richard Hammack Book of Proof 


Counting Multisets 99 


Example 3.19 A bag contains 20 identical red marbles, 20 identical green 
marbles, and 20 identical blue marbles. You reach in and grab 20 marbles. 
There are many possible outcomes. You could have 11 reds, 4 greens and 5 
blues. Or you could have 20 reds, 0 greens and 0 blues, etc. All together, 
how many outcomes are possible? 


Solution: Each outcome can be thought of as a 20-element multiset made 
from the elements of the 3-element set X = {r,c,B}. For example, 11 reds, 4 
greens and 5 blues would correspond to the multiset 


[R,R,R,R,R,R,R,R,R,R,R,G,G,G,G,B,B,B,B,B J. 
The outcome consisting of 10 reds and 10 blues corresponds to the multiset 
[R,R,R,R,R,R,R,R,R,R,B,B,B,B,B,B,B,B,B,B |. 


Thus the total number of outcomes is the number of 20-element multisets 
made from the elements of the 3-element set X = {r,c,B}. By Fact 3.7, the 


answer is (7°22-") = (5°) = 281 possible outcomes. 


Rather than remembering the formula in Fact 3.7, it is probably best 
to work out a new stars-and-bars model as needed. This is because it is 
often easy to see how a particular problem can be modeled with stars and 
bars, and once they have been set up, the formula in Fact 3.7 falls out 
automatically. 

For instance, we could solve Example 3.19 by noting that each outcome 
has a star-and-bar encoding using 20 stars and 2 bars. (The outcome 
[R,R,R,R,R,R,R,R,R,R,R,G,G,G,G,B,B,B,B,B] can be encoded in stars and bars as 
Kok oK KK KKK x x x | kok kk | kk kk x, etc.) We can form such a list by choosing 
2 out of 22 slots for bars and filling the remaining 20 slots with stars. There 
are (””) = 231 ways of doing this. 

Our next example involves counting the number of non-negative integer 
solutions of the equation w+x+y+z=20. By a non-negative integer solution 
to the equation, we mean an assignment of non-negative integers to the vari- 
ables that makes the equation true. For example, one solution is w = 7, x = 38, 
y=5,z=5. We can write this solution compactly as (w,x,y,z) = (7,3,5,5). 
Two other solutions are (w,x,y,z) = (1,3,1,15) and (w,x, y,z) =(0,20,0,0). We 
would not include (w,x, y,z) = (1,—1,10,10) as a solution because even though 
it satisfies the equation, the value of x is negative. How many solutions are 
there all together? The next example presents a way of solving this type of 
question. 


Free PDF version C\E 


100 Counting 


Example 3.20 How many non-negative integer solutions does the equa- 
tion w+x+y+z= 20 have? 


Solution: We can model a solution with stars and bars. For example, 
encode the solution (w,x, y,z) =(3,4,5,8) as 


3 4 5 8 


oS (OSD | _ OOD | 
CHE | CEER | EEE BAHAR, 


In general, any solution (w,x, y,z) =(a,b,c,d) gets encoded as 


a stars b stars c stars d stars 


where all together there are 20 stars and 3 bars. So, for instance the solution 
(w, x,y,z) =(0,0, 10,10) gets encoded as ||* * * ** #* ee k[ RRR KKK RRR &, aNd 
the solution (w,x, y,z) = (7,3,5,5) is encoded as * x x % | kk | ko ok ok ok | oo eK, 
Thus we can describe any non-negative integer solution to the equation as 
a list of length 20+ 3 = 23 that has 20 stars and 3 bars. We can make any 
such list by choosing 3 out of 23 spots for the bars, and filling the remaining 
20 spots with stars. The number of ways to do this is (F) = ae so 
23-11-7=1771. Thus there are 1771 non-negative integer solutions of 
w+xtytz=20. 


For another approach to this example, model solutions of w+x+y+z = 20 
as 20-element multisets made from the elements of {w,x,y,z}. For example, 
solution (5,5,4,6) corresponds to [w,w,w,w,W, X,X,X,X,X, V,V,VsV5 Z5252,2,2,21. 
By Fact 3.7, there are (”°55~') = (52) = 1771 such multisets, so this is the 
number of solutions to w+x+y+z= 20. 


Example 3.21 This problem concerns the lists (w,x, y,z) of integers with 
the property that O< w<x<y<z<10. That is, each entry is an integer 
between 0 and 10, and the entries are ordered from smallest to largest. For 
example, (0,3,3,7), (1,1,1,1) and (2,3,6,9) have this property, but (2,3,6,4) 
does not. How many such lists are there? 


Solution: We can encode such a list with 10 stars and 4 bars, where w is 
the number of stars to the left of the first bar, x is the number of stars to 
the left of the second bar, y is the number of stars to the left of the third 
bar, and z is the number of stars to the left of the fourth bar. 

For example, (2,3,6,9) is encoded as * «| *|* * *|* « *|*, and (1,2,3,4) is 
encoded as *|«|*|*|* **** *. 


Richard Hammack Book of Proof 


Counting Multisets 101 


Here are some other examples of lists paired with their encodings. 


(0,3,3,7) J ok ok | fk ok ok oR | oe 
(1,1,1,1) «||| |e eR KK KK RK 


(9,9,9, 10) xk RR ||| oe | 


Such encodings are lists of length 14, with 10 stars and 4 bars. We can 
make such a list by choosing 4 of the 14 slots for the bars and filling the 
remaining slots with stars. The number of ways to do this is (‘) = 1001. 
Answer: There are 1001 such lists. 


We will examine one more type of multiset problem. To motivate it, 
consider the permutations of the letters of the word “BOOK.” At first glance 
there are 4 letters, so we should get 4! = 24 permutations. But this is not 
quite right because two of the letters are identical. We could interchange the 
two O’s but still have the same permutation. To get a grip on the problem, 
let’s make one of the letters lower case: BOoK. Now our 24 permutations 
are listed below in the oval. 


BOOK KOOB OOKB OOBK OBOK OKOB OKBO OBKO BKOO KBOO KOBO BOKO 


The columns in the oval correspond to the same permutation of the letters 
of BOOK, as indicated in the row below the oval. Thus there are actually 
4 = 24 — 12 permutations of the letters of BOOK. 

This is actually a problem about multisets. The letters in “BOOK” form a 
multiset |[B,O,0,K], and we have determined that there are 12 permutations 
of this multiset. 

For another motivational example, consider the permutations of the 
letters of the word BANANA. Here there are two N’s and three A’s. Though 
some of the letters look identical, think of them as distinct physical objects 
that we can permute into different orderings. It helps to subscript the 
letters to emphasize that they are actually six distinct objects: 


BAN, AgN2 Az. 


Now, there are 6! = 720 permutations of these six letters. It’s not practical 
to write out all of them, but we can get a sense of the problem by making a 
partial listing in the box below. 


Free PDF version [S)= 


102 


BA, N,AgNo2 A3 
BA,N1A3No2 Ag 
BAgN,A1 No A3 
BAgN 1 A3 Noe Ay 
BA3N,AgNo2 Ay 
BA3N A, No Ag 
BA, NoAgNj Ag 
BA, N2A3Njq Ag 
BAgN2A1 Nj Ag 
BAgNo2A3Nq Ay 
BA3 No AgNq Ay 
BA3N2A1Nq Ag 


BANANA 


A;BN,A2gNo A3 
A1 BN, A3 Ngo Ag 
AgBN 1A No A3 
AgBN,A3No A, 
A3 BN ,A2No Ay 
Ag BN 1A ,No Ag 
A;BNoAgN, Ag 
A;BNoA3Nyq Ag 
Ag BNoA,Ny Ag 
AgBNoA3Ny Ay 
A3BNoAgN, Ay 
A3BNoA,Nyq Ag 


ABNANA 


Counting 


720 permutations 


of BA; N;A Noe A3 


The first column lists the permutations of B A; N; Az No Ag corresponding to 
the word BANANA. By the multiplication principle, the column has 3!2! = 
permutations because the three A,;’s can be permuted in 3! ways within 
their positions, and the two N;’s can be permuted in 2! ways. Similarly, the 
second column lists the 3!2! = 12 permutations corresponding to the “word” 
ABNANA. 

All together there are 6! = 720 permutations of BA;N;A2NoAz3, and 
groupings of 12 of them correspond to particular Permalons of BANANA. 
Therefore the total number of permutations of BANANA is on = = Ms = 60. 


The kind of reasoning used here generalizes to the following fact. 


Fact 3.8 Suppose a multiset A has n elements, with multiplicities 
P1,P2;---,Pk. Then the total number of permutations of A is 


n! 
Pi! pe! ++: pe! 


Example 3.22 Count the permutations of the letters in MISSISSIPPI. 


Solution: Think of this word as an 11- element multiset with one M, four 
Is, four S’s and two P’s. By Fact 3.8, it has qqrarm i va = 34,650 permutations. 


Example 3.23 Determine the number of permutations of the multiset 
[1,1,1,1,5,5, 10, 25,25]. 


Solution: By Fact 3.8 the answer is a t= = 3780. 


Richard Hammack Book of Proof 


Counting Multisets 103 


Exercises for Section 3.8 


1. 
. How many 2-element multisets can be made from the 26 letters of the alphabet? 


20. 


21. 


How many 10-element multisets can be made from the symbols {1,2,3,4}? 


. You have a dollar in pennies, a dollar in nickels, a dollar in dimes, and a dollar 


in quarters. You give a friend four coins. How many ways can this be done? 


. Abag contains 20 identical red balls, 20 identical blue balls, 20 identical green 


balls, and 20 identical white balls. You reach in and grab 15 balls. How many 
different outcomes are possible? 


. Abag contains 20 identical red balls, 20 identical blue balls, 20 identical green 


balls, and one white ball. You reach in and grab 15 balls. How many different 
outcomes are possible? 


. A bag contains 20 identical red balls, 20 identical blue balls, 20 identical green 


balls, one white ball, and one black ball. You reach in and grab 20 balls. How 
many different outcomes are possible? 


. In how many ways can you place 20 identical balls into five different boxes? 
. How many lists (x,y,z) of three integers are there with 0< x < y<z< 100? 
. A bag contains 50 pennies, 50 nickels, 50 dimes and 50 quarters. You reach in 


and grab 30 coins. How many different outcomes are possible? 


. How many non-negative integer solutions does u+u+w+x+y+z=90 have? 
. How many integer solutions does the equation w+x+y+z=100 have if w= 4, 


x=2, y=0andz=0? 


. How many integer solutions does the equation w+x+y+z=100 have if w= 7, 


x20, y=5andz2=4? 


. How many length-6 lists can be made from the symbols {A, B, c, D, £, F, G}, if 


repetition is allowed and the list is in alphabetical order? (Examples: BBcEGG, 
but not BBBAGG.) 


. How many permutations are there of the letters in the word “PEPPERMINT”? 
. How many permutations are there of the letters in the word “TENNESSEE”? 
. A community in Canada’s Northwest Territories is known in the local language 


as “TUKTUYAAQTUUQ.” How many permutations does this name have? 


. You roll a dice six times in a row. How many possible outcomes are there that 


have two 1’s three 5’s and one 6? 


. Flip a coin ten times in a row. How many outcomes have 3 heads and 7 tails? 
. In how many ways can you place 15 identical balls into 20 different boxes if each 


box can hold at most one ball? 
You distribute 25 identical pieces of candy among five children. In how many 
ways can this be done? 


How many numbers between 10,000 and 99,999 contain one or more of the digits 
3, 4 and 8, but no others? 


Free PDF version C\E 


104 Counting 


3.9 The Division and Pigeonhole Principles 


Our final fundamental counting principle is called the division principle. 
Before discussing it, we need some notation. Given a number x, its floor 
[x] is x rounded down to the nearest integer. Thus eal = 2, and [9.31] =9, 
and [7] =7, etc. The ceiling of x, denoted [x], is x rounded up to the nearest 
integer. Thus ed = 8, and [9.31] = 10, and [7] =7. 

The division principle is often illustrated by a simple situation involving 
pigeons. Imagine n pigeons that live in k pigeonholes, or boxes. (Possibly 
n#k.) At night all the pigeons fly into the boxes. When this happens, some 
of the k boxes may contain more than one pigeon, and some may be empty. 
But no matter what, the average number of pigeons per box is ?. Obviously, 
at least one of the boxes contains | or more pigeons. (Because not all the 
boxes can contain fewer than the average number of pigeons per box.) And 
because a box must contain a whole number of pigeons, we round up to 
conclude that at least one box contains [7] or more pigeons. 

Similarly, at least one box must contain }? or fewer pigeons, because not 
all boxes can contain more than the average number of pigeons per box. 
Rounding down, at least one box contains | 7] or fewer pigeons. 

We call this line of reasoning the division principle. (Some texts call it 
the strong form of the pigeonhole principle.) 


Fact 3.9 (Division Principle) 

Suppose n objects are placed into k boxes. 

Then at least one box contains [7] or more objects, 
and at least one box contains |%] or fewer objects. 


This has a useful variant. If n >k, then % > 1, so [Z]>1, and this means 
some box contains more than one object. On the other hand, if n < k then 
i <1, so |Z] <1, meaning at least one box is empty. Thus the division 
principle yields the following consequence, called the pigeonhole principle. 


Fact 3.10 (Pigeonhole Principle) 

Suppose n objects are placed into k boxes. 

Ifn>k, then at least one box contains more than one object. 
Ifn<k, then at least one box is empty. 


The pigeonhole principle is named for the scenario in which n pigeons 
fly into k pigeonholes (or boxes). If there are more pigeons than boxes (n > k) 


Richard Hammack Book of Proof 


The Division and Pigeonhole Principles 105 


then some box gets more than one pigeon. And if there are fewer pigeons 
than boxes (n < k) then there must be at least one empty box. 

Like the multiplication, addition and subtraction principles, the division 
and pigeonhole principles are intuitive and obvious, but they can prove 
things that are not obvious. The challenge is seeing where and how to apply 
them. Our examples will start simple and get progressively more complex. 


For an extremely simple application, notice that in any group of 13 
people, at least two of them were born on the same month. Although this is 
obvious, it really does follow from the pigeonhole principle. Think of the 13 
people as objects, and put each person in the “box” that is his birth month. 
As there are more people than boxes (months), at least one box (month) has 
two or more people in it, meaning at least two of the 13 people were born in 
the same month. 

Further, for any group of 100 people, the division principle says that 
there is a month in which py = 9 or more of the people were born. It also 


guarantees a month in which ey = 8 or fewer of the people were born. 


Example 3.24 Pick six integers between 0 and 9 (inclusive). Show that 
two of them must add up to 9. 


For example, suppose you picked 0, 1, 3, 5, 7 and 8. Then 1+8=9. If you 
picked 4, 5, 6, 7, 8, 9. then 4+5=9. The problem asks us to show that this 
happens no matter how we pick the numbers. 


Solution: Pick six numbers between 0 and 9. Here’s why two of them sum 
to 9: Imagine five boxes, each marked with two numbers, as shown below. 
Each box is labeled so that the two numbers written on it sum to 9. 


AL / VATA VATA Al / Ai / 
al) (us) (axl) fal) las) 


For each number that was picked, put it in the box having that number 
written on it. For example, if we picked 7, it goes in the box labeled “2, 7.” 
(The number 2, if picked, would go in that box too.) In this way we place the 
six chosen numbers in five boxes. As there are more numbers than boxes, 
the pigeonhole principle says that some box has more than one (hence two) 
of the picked numbers in it. Those two numbers sum to 9. 


Notice that if we picked only five numbers from 0 to 9, then it’s possible 
that no two sum to 9: we could be unlucky and pick 0, 1, 2, 3, 4. But the 
pigeonhole principle ensures that if six are picked then two do sum to 9. 


Free PDF version C\E 


106 Counting 


Example 3.25 A store has a gumball machine containing a large number 
of red, green, blue and white gumballs. You get one gumball for each nickel 
you put into the machine. The store offers the following deal: You agree to 
buy some number of gumballs, and if 13 or more of them have the same 
color you get $5. What is the fewest number of gumballs you need to buy to 
be 100% certain that you will make money on the deal? 


Solution: Let n be the number of gumballs that you buy. Imagine sorting 
your n gumballs into four boxes labeled RED, GREEN, BLUE, and WHITE. 
(That is, red balls go in the red box, green balls go in the green box, etc.) 


Ui fe A a Jif (N= =f 


The division principle says that one box contains [}] or more gumballs. 
Provided [4] = 13, you will know you have 13 gumballs of the same color. 
This happens if } > 12 (so the ceiling of | rounds to a value larger than 12). 
Therefore you need n > 4-12 = 48, so if n = 49 you know you have at least 
Pal = [12.25] = 13 gumballs of the same color. 


Answer: Buy 49 gumballs for 49 nickels, which is $2.45. You get $5, and 
therefore have made $2.55. 


Note that if you bought just 48 gumballs, you might win, but there is a 
chance that you’d get 12 gumballs of each color and miss out on the $5. And 
if you bought more than 49, you’d still get the $5, but you would have spent 
more nickels. 


Explicitly mentioning the boxes in the above solution is not necessary. 
Some people prefer to draw a conclusion based on averaging alone. They 
might solve the problem by letting n be the number of gumballs bought, 
son=r+g+b6+w, where r is the number of them that are red, g is the 
number that are green, b is the number of blues and w is the number 
of whites. Then the average number of gumballs of a particular color is 


b 
Fae ee ee. Le Wewnecd this to be greater than 12 to ensure 13 of the 


same color, and the smallest number that does the job is n = 49. This is still 
the division principle, in a pure form. 


Richard Hammack Book of Proof 


The Division and Pigeonhole Principles 107 


Example 3.26 Nine points are randomly placed on the right triangle 
shown below. Show that three of these points form a triangle whose area is 
é square unit or less. (We allow triangles with zero area, in which case the 
three points lie on a line.) 


SS 
1 


Solution: Divide the triangle into four smaller triangles, as indicated 
by the dashed lines below. Each of these four triangles has an area of 
3bh = $55 = g Square units. Think of these smaller 
triangles as “boxes.” So we have placed 9 points in 4 
boxes. (If one of the 9 points happens to be on a 
1 dashed line, say it belongs to the box below or to its 
left.) The division principle says one of the boxes 
has at least ry = 3 of the points in it. Those three 
es points form a triangle whose area is no larger than 
the area of the “box” that it is in. Thus these three 
points form a triangle whose area is F or less. 


Exercises for Section 3.9 


1. Show that if six integers are chosen at random, then at least two of them will 
have the same remainder when divided by 5. 


2. You deal a pile of cards, face down, from a standard 52-card deck. What is 
the least number of cards the pile must have before you can be assured that it 
contains at least five cards of the same suit? 


3. What is the fewest number of times you must roll a six-sided dice before you can 
be assured that 10 or more of the rolls resulted in the same number? 


4, Select any five points on a square whose side-length is one unit. Show that at 
least two of these points are within 2 units of each other. 

5. Prove that any set of seven distinct natural numbers contains a pair of numbers 
whose sum or difference is a multiple of 10. 

6. Given a sphere S, a great circle of S is the intersection of S with a plane through 
its center. Every great circle divides S into two parts. A hemisphere is the union 


of the great circle and one of these two parts. Show that if five points are placed 
arbitrarily on S, then there is a hemisphere that contains four of them. 


Free PDF version C\E 


108 Counting 


3.10 Combinatorial Proof 


Combinatorial proof is a method of proving two different expressions are 
equal by showing that they are both answers to the same counting question. 
We have already used combinatorial proof (without calling it combinatorial 
proof) in proving Pascal’s formula ("7") = (,”,) + (7) on page 90. 

There we argued that the left-hand side (”;') is, by definition, the number 
of k-element subsets of the set S = {0,1,2,...,n} with |S|=n+1. But the right- 
hand side also gives the number of k-element subsets of S, because such a 
subset either contains 0 or it does not. We can make any k-element subset of 
S that contains 0 by starting with 0 and selecting k— 1 other elements from 
{1,2,...,n}, in (,”,) ways. We can make any k-element subset that does not 
contain 0 by selecting k elements from {1,2,...,n}, and there are (7) ways to 
do this. Thus, 


—S— —S— —a 
number of number of number of 
k-element k-element k-element 
subsets of subsets of subsets of 

S={0,1,...,n} S with 0 S without 0 


Both sides count the number of k-element subsets of S, so they are equal. 
This is combinatorial proof. 


Example 3.27 Use combinatorial proof to show (7) = (,,”,). 


Solution: First, by definition, if k <0 or k >n, then both sides are 0, and 
thus equal. Therefore for the rest of the proof we can assume 0 <k <n. 
The left-hand side (/;) is the number of k-element subsets of S = {1,2,...,n}. 
Every k-element subset X ¢ S pairs with a unique (n—k)-element subset 
X =S-—X<S. Thus the number of k-element subsets of S equals the number 


n 


of (n—k)-element subsets of S, which is to say (7) =(,,",). 


We could also derive (7) = (,,",,) by using the formula for (7) and quickly 
get 


n\_ n! _ n! 7 n! _[n 
n-k} (n—k)!\(n-(n—-k))! (n—-k)RL Rin —k)! YY 


But you may feel that the combinatorial proof is “slicker” because it uses 
the meanings of the terms. Often it is flat-out easier than using formulas, 
as in the next example. 


Richard Hammack Book of Proof 


Combinatorial Proof 109 


n 
Our next example will prove that > aie = (°"), for any positive integer n, 


which is to say that (ar + Gy + (Gy feet ae = (°"). For example, if n =5, this 
+(3) +) +() +6) 


1? +574 107+ 107 +57 +17 =('P), 


2 2 2 2 2 


statement asserts ie +(°) 


: = (%), that is, 


which is true, as both sides equal 252. In general, the statement says that 
the squares of the entries in the nth row of Pascal’s triangle add up to eae 


rt 2 

Example 3.28 Use a combinatorial proof to show that )- 4 = " 
h=0 

Solution: First, the right-hand side (*”) is the number of ways to select n 

things from a set S that has 2n elements. 

Now let’s count this a different way. Divide S into two equal-sized parts, 
S=AUB, where |A|=n and |B|=n, and ANB=@. 

For any fixed k with 0<k <n, we can select n things from S by taking k 
things from A and n-k things from B for a total of k+(n—k)=n things. By 
the multiplication principle, we get (;)(,,”,,) n-element subsets of S this way. 

As k could be any number from 0 to n, the number of ways to select n 
things from S is thus 


nn 
= 
| 38 
(=) 
— 
+ 
a, 
BPS 


a 
~) 

| 3s 
Hb 

bn 
+ 
, 
Nw 38 


| 
~) 
| 3s 
(se) 
——_ 
+ 
+ 
a, 
aa) 


wees Ne eee Oe eee TO wm 

<< ty <4 a =*] << 

Bf g€ &€ £ F & & Bg 

He & & & & & & & Hf 

o es 4 7 xn ¢ mo 6 s OS 
ind fwd fwd 


But because (,,”,,) = (;), this expression equals (5)(9)+ (7)(7)+(S)(G)+--- + (CG), 
. ~ ¢n\2 , (n\2 , (n\2 ny2 nN ny2 
which is (6) + (7) +(g) +---+() = (,) - 
In summary, we’ve counted the ways to choose n elements from the set 
S with two methods. One method gives (*”), and the other gives Y aie 
k=0 


Therefore 5 ag = ea 
k=0 


Free PDF version [S)= 


110 Counting 


Be on the lookout for opportunities to use combinatorial proof, and watch 
for it in your readings outside of this course. Also, try some of the exercises 
below. Sometimes it takes some creative thinking and false starts before 
you hit on an idea that works, but once you find it the solution is usually 
remarkably simple. 


Exercises for Section 3.10 

Use combinatorial proof to solve the following problems. You may assume that any 
variables m,n,k and p are non-negative integers. 

1. Show that 1(n - 0) +2(n-1)+ 3(n-2)+ 4(n - 3) +++ +(n- 1)2+(n-0)1 = ("3?). 

. Show that 1+2+3+---+n=("3"). 

. Show that (3)(-3) = ()(). 

. Show that P(n,k)=P(n-1,k)+k-P(n-1,k-1). 

. Show that (%”) = 2(%)+n?. 

. Show that (2) = 3(3)+ 6n($) +73. 


P m n _ (mtn 
- Show that ¥ ()(,"4)= ("3"): 


1 QO oa Pf wD WNW 


P 


8. Show that ¥ (")(,”,) =(™*”). 
k=0 


pt+k m+p 


9. Show that Pe (#) =("*4), 

10. Show that ¥ k(")=n2"-1. 
k=1 

11. Show that ¥ 2*(2)=3". 
k=0 


12. Show that ¥ (")(*)=(")a"-™. 
k=0 


m 


Richard Hammack Book of Proof 


Part Il 


How to Prove Conditional 
Statements 


CHAPTER 4 


Direct Proof 


t is time to prove some theorems. There are various strategies for doing 

this; we now examine the most straightforward approach, a technique 

called direct proof. As we begin, it is important to keep in mind the meanings 
of three key terms: theorem, proof and definition. 

A theorem is a mathematical statement that is true and can be (and 
has been) verified as true. A proof of a theorem is a written verification 
that shows that the theorem is definitely and unequivocally true. A proof 
should be understandable and convincing to anyone who has the requisite 
background and knowledge. This knowledge includes an understanding of 
the meanings of the mathematical words, phrases and symbols that occur 
in the theorem and its proof. It is crucial that both the writer of the proof 
and the readers of the proof agree on the exact meanings of all the words, 
for otherwise there is an intolerable level of ambiguity. A definition is an 
exact, unambiguous explanation of the meaning of a mathematical word or 
phrase. We will elaborate on the terms theorem and definition in the next 
two sections, and then finally we will be ready to begin writing proofs. 


4.1 Theorems 


A theorem is a statement that is true and has been proved to be true. You 
have encountered many theorems in your mathematical education. Here 
are some theorems taken from an undergraduate calculus text. They will 
be familiar to you, though you may not have read all the proofs. 


Theorem Let f be differentiable on an open interval J and let ce I. 
If f(c) is the maximum or minimum value of f on J, then f'(c) =0. 


Theorem If >? , a; converges, then jim ap, =0. 
00 


Theorem Suppose f is continuous on the interval [a,b]. Then f is 
integrable on [a,b]. 


Theorem Every absolutely convergent series converges. 


114 Direct Proof 


Observe that each of these theorems either has the conditional form “/f 
P, then Q,” or can be put into that form. The first theorem has an initial 
sentence “Let f be differentiable on an open interval I, and let c€ I,” which 
sets up some notation, but a conditional statement follows it. The third 
theorem has form “Suppose P. Then Q,” but this means the same thing 
as “If P, then Q.” The last theorem can be re-expressed as “If a series is 
absolutely convergent, then it is convergent.” 

For another example, the equation Y7_, (”)? = ?") from Example 3.28 
(page 109) is best phrased as a conditional statement, to make clear the 
assumption that n is an integer. 


Be) 


n 
Theorem If7n is a non-negative integer, then > @y = (7). 
k=0 


A theorem of the form “If P, then Q,” can be regarded as a device that 
produces new information from P. Whenever we are dealing with a situation 
in which P is true, then the theorem guarantees that, in addition, Q is true. 
Since this kind of expansion of information is useful, theorems of the form 
“If P, then Q,” are very common. 

But not every theorem is a conditional statement. Some have the form 
of the biconditional P = Q, but, as we know, that can be expressed as two 
conditional statements. Other theorems simply state facts about specific 
things. For example, here is another theorem from your study of calculus. 


Theorem The harmonic series 1+ 5+4+4+#+--- diverges. 
It would be difficult (or at least awkward) to restate this as a conditional 
statement. Still, it is true that most theorems are conditional statements, 
so much of this book will concentrate on that type of theorem. 

It is important to be aware that there are a number of words that mean 
essentially the same thing as the word “theorem,” but are used in slightly 
different ways. In general the word “theorem” is reserved for a statement 
that is considered important or significant (the Pythagorean theorem, for 
example). A statement that is true but not as significant is sometimes called 
a proposition. A lemma is a theorem whose main purpose is to help prove 
another theorem. A corollary is a result that is an immediate consequence 
of a theorem or proposition. It is not important that you remember all these 
words now, for their meanings will become clear with usage. 

Our main task is to learn how to prove theorems. As the above examples 
suggest, proving theorems requires a clear understanding of the conditional 
statement, and that is the main reason we studied it so extensively in 
Chapter 2. It is also crucial to understand the role of definitions. 


Richard Hammack Book of Proof 


Definitions 115 


4.2 Definitions 


A proof of a theorem should be absolutely convincing. Ambiguity must be 
avoided. Everyone must agree on the exact meaning of each mathematical 
term. In Chapter 1 we defined the meanings of the sets N, Z, R, Q and @, as 
well as the meanings of the symbols € and ¢, and we shall make frequent 
use of these things. Here is another definition that we use often. 


Definition 4.1 An integer n is even if n = 2a for some integer a€ Z. 


Thus, for example, 10 is even because 10 = 2-5. Also, according to the 
definition, 7 is not even because there is no integer a for which 7 = 2a. While 
there would be nothing wrong with defining an integer to be odd if it’s not 
even, the following definition is more concrete. 


Definition 4.2 An integer n is odd if n = 2a +1 for some integer a€ Z. 


Thus 7 is odd because 7 = 2-3+1. We will use these definitions whenever 
the concept of even or odd numbers arises. If in a proof a certain number is 
even, the definition allows us to write it as 2a for an appropriate integer a. 
If some quantity has form 2b+1 where 6b is an integer, then the definition 
tells us the quantity is odd. 


Definition 4.3 Two integers have the same parity if they are both 
even or they are both odd. Otherwise they have opposite parity. 


Thus 5 and —17 have the same parity, as do 8 and 0; but 3 and 4 have 
opposite parity. 

Two points about definitions are in order. First, in this book the word 
or term being defined appears in boldface type. Second, it is common to 
express definitions as conditional statements even though the biconditional 
would more appropriately convey the meaning. Consider the definition of an 
even integer. You understand full well that if n is even then n = 2a (for a € Z), 
and if n = 2a, then n is even. Thus, technically the definition should read 
“An integer n is even if and only if n = 2a for some aé Z.” However, it is an 
almost-universal convention that definitions are phrased in the conditional 
form, even though they are interpreted as being in the biconditional form. 
There is really no good reason for this, other than economy of words. It is 
the standard way of writing definitions, and we have to get used to it. 

Here is another definition that we will use often. 


Free PDF version [S)= 


116 Direct Proof 


Definition 4.4 Suppose a and 6 are integers. We say that a divides b, 
written a|b, if b=ac for some c€é Z. In this case we also say thataisa 
divisor of b, and that 6 is a multiple of a. 


For example, 5 divides 15 because 15 = 5-3. We write this as 5| 15. 
Similarly 8 | 32 because 32 = 8-4, and —6| 6 because 6 = —6-—1. However, 6 
does not divide 9 because there is no integer c for which 9 = 6-c. We express 
this as 6{9, which we read as “6 does not divide 9.” 

Be careful of your interpretation of the symbols. There is a big difference 
between the expressions a|b and a/b. The expression a |b is a statement, 
while a/b is a fraction. For example, 8| 16 is true and 8| 20 is false. By 
contrast, 8/16 = 0.5 and 8/20 = 0.4 are numbers, not statements. Be careful 
not to write one when you mean the other. 

Every integer has a set of integers that divide it. For example, the set of 
divisors of 6 is {ae Z:a|6} = {-6,-3, -2,-1,1,2,3,6}. The set of divisors of 5 
is {-5,-1,1,5}. The set of divisors of 0 is Z. This brings us to the following 
definition, with which you are already familiar. 


Definition 4.5 A number n€N is prime if it has exactly two positive 
divisors, 1 and n. If n has more than two positive divisors, it is called 
composite. (Thus n is composite if and only if n = ab for 1<a,b<n.) 


For example, 2 is prime, as are 5 and 17. The definition implies that 1 is 
neither prime nor composite, as it only has one positive divisor, namely 1. 


Definition 4.6 The greatest common divisor of integers a and b, 
denoted gcd(a,b), is the largest integer that divides both a and b. 

The least common multiple of non-zero integers a and b, denoted 
lem(a,b), is the smallest integer in N that is a multiple of both a and b. 


So ged(18,24) = 6, ged(5,5)=5 and ged(32,—-8) = 8. Also gcd(50, 18) = 2, 
but ged(50,9) = 1. Note that gcd(0,6) = 6, because, although every integer 
divides 0, the largest divisor of 6 is 6. 

The expression gcd(0,0) is problematic. Every integer divides 0, so the 
only conclusion is that gced(0,0) = oo. We circumvent this irregularity by 
simply agreeing to consider gcd(a,b) only when a and b are not both zero. 

Continuing our examples, lcm(4,6) = 12, and lem(7, 7) = 7. 

Of course not all terms can be defined. If every word in a definition were 
defined, there would be separate definitions for the words that appeared 


Richard Hammack Book of Proof 


Definitions 117 


in those definitions, and so on, until the chain of defined terms became 
circular. Thus we accept some ideas as being so intuitively clear that they 
require no definitions or verifications. For example, we will not find it 
necessary to define what an integer (or a real number) is. Nor will we define 
addition, multiplication, subtraction and division, though we will use these 
operations freely. We accept and use such things as the distributive and 
commutative properties of addition and multiplication, as well as other 
standard properties of arithmetic and algebra. 

As mentioned in Section 1.9, we accept as fact the natural ordering of the 
elements of N,Z,Q and R, so that (for example) statements such as “5 < 7,” 
and “x < y implies —x > —y,” do not need to be justified. 

In addition, we accept the following fact without justification or proof. 


Fact 4.1 Ifa and bd are integers, then so are their sum, product and 
difference. That is, ifa,be Z, thena+beZ, a—beZ and abe Z. 


It follows that any combination of integers using the operations +, — and - 
is an integer. For example, if a,b and c are integers, then a2b —ca+be Z. 

We will also accept as obvious the fact that any integer a can be divided 
by a positive integer b, resulting in a unique quotient g and remainder r. 
For example, b = 3 goes into a= 17 q =5 times with remainder r= 2. In 
symbols, 17=5-3+2, ora=qb+r. This fact, called the division algorithm, 
was mentioned on page 30. 


(The Division Algorithm) Given integers a and b with b > 0, there exist 
unique integers g andr for which a=qb+rand0<r<b. 


Another fact that we will accept without proof (at least for now) is 
that every natural number greater than 1 has a unique factorization into 
primes. For example, the number 1176 can be factored into primes as 
1176 = 2-2-2-3-7-7 = 2°-3-72. By unique we mean that any factorization 
of 1176 into primes will have exactly the same factors (i.e., three 2’s, one 3 
and two 7’s). Thus, for example, there is no valid factorization of 1176 that 
has a factor of 5. You may be so used to factoring numbers into primes that 
it seems obvious that there cannot be different prime factorizations of the 
same number, but in fact this is a fundamental result whose proof is not 
transparent. Nonetheless, we will be content to assume that every natural 
number greater than 1 has a unique factorization into primes. (We will 
revisit the issue of a proof in Section 10.3.) 

We will introduce other accepted facts, as well as definitions, as needed. 


Free PDF version [S)= 


118 Direct Proof 


4.3 Direct Proof 


This section explains a simple way to prove theorems or propositions that 
have the form of conditional statements. The technique is called direct 
proof. To simplify the discussion, our first examples will involve proving 
statements that are almost obviously true. Thus we will call the statements 
propositions rather than theorems. (Remember, a proposition is a statement 
that, although true, is not as significant as a theorem.) 

To understand how the technique of direct proof works, suppose we have 
some proposition of the following form. 


Proposition If P, then Q. 


This proposition is a conditional statement of form P > Q. Our goal is 
to show that this conditional statement is true. To see how to proceed, look 
at the truth table for P>Q. 


P|Q\|P>Q 
T|T T 
T\|F F 
F\T T 
F\F T 


The table shows that if P is false, the statement P > Q is automatically 
true. This means that if we are concerned with showing P > Q is true, we 
don’t have to worry about the situations where P is false (as in the last two 
lines of the table) because the statement P > Q will be automatically true 
in those cases. But we must be very careful about the situations where P is 
true (as in the first two lines of the table). We must show that the condition 
of P being true forces Q to be true also, for that means the second line of 
the table cannot happen. 

This gives a fundamental outline for proving statements of the form 
P=>Q. Begin by assuming that P is true (remember, we don’t need to worry 
about P being false) and show this forces Q to be true. 


Outline for Direct Proof 
Proposition If P, then Q. 


Proof. Suppose P. 


Therefore Q. a 


Richard Hammack Book of Proof 


Direct Proof 119 


So the setup for direct proof is remarkably simple. The first line of the 
proof is the sentence “Suppose P.” The last line is the sentence “Therefore Q.” 
Between the first and last line we use logic, definitions and standard math 
facts to transform the statement P to the statement Q. It is common to use 
the word “Proof” to indicate the beginning of a proof, and the symbol m to 
indicate the end. 

As our first example, let’s prove that if x is odd then x? is also odd. 
(Granted, this is not a terribly impressive result, but we will move on to 
more significant things in due time.) The first step in the proof is to fill in 
the outline for direct proof. This is a lot like painting a picture, where the 
basic structure is sketched in first. We leave some space between the first 
and last line of the proof. The following series of frames indicates the steps 
you might take to fill in this space with a logical chain of reasoning. 


Proposition If x is odd, then x? is odd. 


Proof. Suppose x is odd. 


Therefore x? is odd. | 


Now that we have written the first and last lines, we need to fill in the space 
with a chain of reasoning that shows that x being odd forces x? to be odd. 

In doing this it’s always advisable to use any definitions that apply. The 
first line says x is odd, and by Definition 4.2 it must be that x = 2a+1 for 
some ae Z, so we write this in as our second line. 


Proposition If x is odd, then x? is odd. 


Proof. Suppose x is odd. 
Then x =2a+1 for some aé Z, by definition of an odd number. 


Therefore x? is odd. | 


Now jump down to the last line, which says x? is odd. Think about what 
the line immediately above it would have to be in order for us to conclude 
that x? is odd. By the definition of an odd number, we would have to have 
x? =2a+1 for some a¢ Z. However, the symbol a now appears earlier in the 
proof in a different context, so we should use a different symbol, say b. 


Free PDF version [S)= 


120 Direct Proof 


Proposition If x is odd, then x? is odd. 


Proof. Suppose x is odd. 
Then x = 2a+1 for some a € Z, by definition of an odd number. 


Thus x? = 2b +1 for an integer b. 
Therefore x? is odd, by definition of an odd number. a 


We are almost there. We can bridge the gap as follows. 


Proposition If x is odd, then x? is odd. 


Proof. Suppose x is odd. 

Then x = 2a+1 for some a € Z, by definition of an odd number. 
Thus x* = (2a+ 1)? = 4a? +4a+1=2(2a7 + 2a)+1. 

So x? =2b+1 where b is the integer b = 2a? +2a. 

Thus x? = 2b +1 for an integer b. 

Therefore x? is odd, by definition of an odd number. a 


Finally, we may wish to clean up our work and write the proof in paragraph 
form. Here is our final version. 


Proposition If x is odd, then x? is odd. 


Proof. Suppose x is odd. Then x = 2a +1 for some a € Z, by definition 
of an odd number. Thus x? = (2a+ 1)? = 4a7+4a4+1 = 2(2a7+2a)+1, so 
x? =2b+1 where b = 2a? +2a€ Z. Therefore x? is odd, by definition 
of an odd number. a 


At least initially, it’s generally a good idea to write the first and last line 
of your proof first, and then fill in the gap, sometimes jumping alternately 
between top and bottom until you meet in the middle, as we did above. This 
way you are constantly reminded that you are aiming for the statement 
at the bottom. Sometimes you will leave too much space, sometimes not 
enough. Sometimes you will get stuck before figuring out what to do. This 
is normal. Mathematicians do scratch work just as artists do sketches for 
their paintings. 


Richard Hammack Book of Proof 


Direct Proof 121 


Here is another example. Consider proving the following proposition. 
Proposition Let a,b and c be integers. Ifa|b and b|c, thena|c. 


Let’s apply the basic outline for direct proof. To clarify the procedure we 
will write the proof in stages again. 


Proposition Let a,b and c be integers. Ifa|b and b|c, thena|c. 


Proof. Suppose a|b and b|c. 


Therefore a|c. a 


Our first step is to apply Definition 4.4 to the first line. The definition 
says a|b means 6 = ac for some integer c, but since c already appears in 
a different context on the first line, we must use a different letter, say d. 
Similarly let’s use a new letter e in the definition of b | c. 


Proposition Let a,b and c be integers. Ifa|b and b|c, thena|c. 


Proof. Suppose a|b and b|c. 
By Definition 4.4, we know a|b means b =ad for some dé Z. 
Likewise, b|c means c= be for some e€ Z. 


Therefore a|c. a 


We have almost bridged the gap. The line immediately above the last line 
should show that a|c. According to Definition 4.4, this line should say that 
c=ax for some integer x. We can get this equation from the lines at the top, 
as follows. 


Proposition Leta,b and c be integers. Ifa|b and b|c, thena|c. 


Proof. Suppose a|b and b|c. 

By Definition 4.4, we know a|b means b =ad for some d € Z. 
Likewise, b|c means c= be for some e€ Z. 

Thus c = be = (ad)e = a(de), so c= ax for the integer x = de. 
Therefore a |c. a 


The next example is presented all at once rather than in stages. 


Free PDF version [S)= 


122 Direct Proof 


Proposition If x is an even integer, then x? — 6x +5 is odd. 


Proof. Suppose x is an even integer. 

Then x = 2a for some a € Z, by definition of an even integer. 

So x?—-6x+5 = (2a)?-6(2a)+5 = 4a?—-12a+5 = 4a?-12a+4+1 = 2(2a7-6a+2)+1. 
Therefore we have x? —6x+5=2b+1, where b = 2a?-6a+2€ Z. 
Consequently x? —6x +5 is odd, by definition of an odd number. a 


One doesn’t normally use a separate line for each sentence in a proof, 
but for clarity we will often do this in the first few chapters of this book. 

Our next example illustrates a standard technique for showing two 
quantities are equal. If we can show m <n and n<™m then it follows that 
m=n. In general, the reasoning involved in showing m <n can be quite 
different from that of showing n < m. 

Recall Definition 4.6 of a least common multiple on page 116. 


Proposition Ifa,b,ceN, then lem(ca,cb) = c-lem(a, bd). 


Proof. Assume a,b,céeN. Let m=I|em(ca,cb) and n= c-lem(a,b). We will 
show m =n. By definition, lcm(a,b) is a positive multiple of both a and 8, so 
lem(a,b) = ax = by for some x,y e€N. From this we see that n = c-lem(a,b) = 
cax = cby is a positive multiple of both ca and cb. But m =|lem(ca,cb) is the 
smallest positive multiple of both ca and cb. Thus m<n. 

On the other hand, as m = lem(ca,cb) is a multiple of both ca and cb, we 
have m =cax = cby for some x,y«€ Z. Then im =ax=by is a multiple of both 
a and b. Therefore lem(a, b) < im, so c-:lem(a,b)< m, that is, n<m. 

We’ve shown m <n and n<m, so m=n. The proof is complete. a 


The examples we’ve looked at so far have all been proofs of statements 
about integers. In our next example, we are going to prove that if x and y 
are positive real numbers for which x < y, then /x < ,/y. You may feel that 
the proof is not as “automatic” as the proofs we have done so far. Finding 
the right steps in a proof can be challenging, and that is part of the fun. 


Proposition Let x and y be positive numbers. If x< y, then Vx < //y. 


Proof. Suppose x < y. Subtracting y from both sides gives x—y <0. 

This can be written as /x" — Jy <0. 

Factor this as a difference of two squares to get (/x— /y\Vx+ /y) <0. 
Dividing both sides by the positive number x + ,/y produces //x- \/y <0. 
Adding \/y to both sides gives \/x < \/y. a 


Richard Hammack Book of Proof 


Direct Proof 123 


This proposition tells us that whenever x < y, we can take the square 
root of both sides and be assured that \/x < \/y. This can be useful, as we 
will see in our next proposition. 

That proposition will concern the expression 2,\/xy < x+y. Notice when 
you substitute random positive values for the variables, the expression is 
true. For example, for x =6 and y =4, the left side is 2V6-4 = 4V6 = 9.79, 
which is less than the right side 6+ 4= 10. Is it true that 2,/xy < x+y for 
any positive x and y? How could we prove it? 

To see how, let’s first cast this into the form of a conditional statement: 
If x and y are positive real numbers, then 2\/xy < x+y. The proof begins 
with the assumption that x and y are positive, and ends with 2\/xy<«x+y. 
In mapping out a strategy, it can be helpful to work backwards, working 
from 2,/xy < x+y to something that is obviously true. Then the steps can be 
reversed in the proof. In this case, squaring both sides of 2\/xy < x+y gives 
us 

Axys xe + 2xy +9", 


Now subtract 4xy from both sides and factor: 


IA 


x? Ixy ty? 


A 


< (x —y)?, 


But this last line is clearly true, since the square of x— y cannot be negative! 
This gives us a strategy for the proof, which follows. 


Proposition If x and y are positive real numbers, then 2,/xy<x+y. 


Proof. Suppose x and y are positive real numbers. 

Observe that 0 <(x—y)?, that is, 0<x?-2xy+y?. 

Adding 4xy to both sides gives 4xy <x?+2xy+y?. 

Factoring yields 4xy < (x+y). 

Previously we proved that such an inequality still holds after taking the 
square root of both sides; doing so produces 2,/xy < x+y. | 


Notice that in the last step of the proof we took the square root of both 
sides of 4xy <(x+y)? and got \/4xy < (x+y), and the fact that this did 
not reverse the symbol < followed from our previous proposition. This is an 
important point. Often the proof of a proposition or theorem uses another 
proposition or theorem (that has already been proved). 


Free PDF version [S)= 


124 Direct Proof 


4,4 Using Cases 


In proving a statement is true, we sometimes have to examine multiple 
cases before showing the statement is true in all possible scenarios. This 
section illustrates a few examples. 

Our examples will concern the expression 1+ (-1)"(2n-1). Here isa 
table showing its value for various integers for n. Notice that 1+(—1)"(2n—-1) 
is a multiple of 4 in every line. 


14+(-1)"(2n - 1) 


Ps] 


Noor wn re 
[o/) 


Is 1+(-1)"(2n —- 1) always a multiple of 4? We prove the answer is “yes” 
in our next example. Notice, however, that the expression 1+(-—1)"(2n - 1) 
behaves differently depending on whether n is even or odd, for in the first 
case (-1)” = 1, and in the second (—1)” = —-1. Thus the proof must examine 
these two possibilities separately. 


Proposition IfneN, then 1+(—1)"(2n—1) is a multiple of 4. 


Proof. Suppose neN. 
Then n is either even or odd. Let’s consider these two cases separately. 


Case 1. Suppose n is even. Then n = 2k for some ke Z, and (-1)" = 1. 
Thus 1+(-1)"(2n -1) = 1+(1)(2-2k -1) = 4k, which is a multiple of 4. 


Case 2. Suppose n is odd. Then n = 2k+1 for some k € Z, and (-1)” = -1. 
Thus 1+(-1)"(2n —- 1) = 1-(2(2k + 1)- 1) = —4k, which is a multiple of 4. 


These cases show that 1+(-1)"(2n-1) is always a multiple of 4. a 


Now let’s examine the flip side of the question. We just proved that 
1+(-1)"(2n —-1) is always a multiple of 4, but can we get every multiple of 4 
this way? The following proposition and proof give an affirmative answer. 


Richard Hammack Book of Proof 


Treating Similar Cases 125 


Proposition Every multiple of 4 equals 1+(-—1)"(2n —-1) for some neEN. 


Proof. In conditional form, the proposition is as follows: 
If k is a multiple of 4, then there is an n EN for which 1+(-1)"(2n-1)=k. 
What follows is a proof of this conditional statement. 
Suppose & is a multiple of 4. 
This means k = 4a for some integer a. 
We must produce an né€N for which 1+ (-1)"(2n-1)=k. 
This is done by cases, depending on whether a is zero, positive or negative. 
Case 1. Suppose a=0. Let n=1. Then 1+ (-1)"(2n —-1)=1+(-1)4(2-1)=0 
=4-0=4a=k. 
Case 2. Suppose a > 0. Let n = 2a, which is in N because a is positive. Also 
n is even, so (-1)” =1. Thus 1+(—1)"(2n—-1) = 1+(2n-1) = 2n = 2(2a) = 4a =k. 
Case 3. Suppose a <0. Let n = 1-2a, which is an element of N because 
a is negative, making 1 -—2a positive. Also n is odd, so (-1)” =—-1. Thus 
1+(-1)’(2n-1)= 1-(2n-1) = 1-(2(1-2a)-1) = 4a =k. 

The above cases show that no matter whether a multiple k = 4a of 4 is 
zero, positive or negative, k = 1+(-1)"(2n —- 1) for some neEN. a 


4.5 Treating Similar Cases 


Occasionally two or more cases in a proof will be so similar that writing 
them separately seems tedious or unnecessary. Here is an example: 


Proposition If two integers have opposite parity, then their sum is odd. 


Proof. Suppose m and n are two integers with opposite parity. 
We need to show that m+n is odd. This is done in two cases, as follows. 
Case 1. Suppose m is even and n is odd. Thus m = 2a and n = 2b +1 for 
some integers a and b. Therefore m+n =2a+2b+1=2(a+b)+1, which is 
odd (by Definition 4.2). 
Case 2. Suppose m is odd and n is even. Thus m = 2a+1 and n = 26 for 
some integers a and b. Therefore m+n =2a+1+2b=2(a+b)+1, which is 
odd (by Definition 4.2). 

In either case, m+n is odd. a 


The two cases in this proof are entirely alike except for the order in 
which the even and odd terms occur. It is entirely appropriate to just do 
one case and indicate that the other case is nearly identical. The phrase 
“Without loss of generality...” is a common way of signaling that the proof is 
treating just one of several nearly identical cases. Here is a second version 
of the above example. 


Free PDF version [S)= 


126 Direct Proof 


Proposition If two integers have opposite parity, then their sum is odd. 


Proof. Suppose m and n are two integers with opposite parity. 

We need to show that m+n is odd. 

Without loss of generality, suppose m is even and n is odd. 

Thus m = 2a and n= 26+1 for some integers a and b. 

Therefore m+n =2a+2b+1=2(a+6)+1, which is odd (by Definition 4.2). 


In reading proofs in other texts, you may sometimes see the phrase 
“Without loss of generality” abbreviated as “WLOG.” However, in the interest 
of transparency we will avoid writing it this way. In a similar spirit, it is 
advisable—at least until you become more experienced in proof writing— 
that you write out all cases, no matter how similar they appear to be. 

Please check your understanding by doing the following exercises. The 
odd numbered problems have complete proofs in the Solutions section in 
the back of the text. 


Exercises for Chapter 4 


Use the method of direct proof to prove the following statements. 
. If x is an even integer, then x? is even. 

. If x is an odd integer, then x° is odd. 

. Ifa is an odd integer, then a? + 3a+5 is odd. 

. Suppose x,y eZ. If x and y are odd, then xy is odd. 

. Suppose x, ye Z. If x is even, then xy is even. 

. Suppose a,b,ce Z. Ifa|b anda|c, then a|(b+c). 

. Suppose a,be Z. If a|b, then a? | b?. 

. Suppose a is an integer. If 5| 2a, then 5|a. 


eomnan4anrh © WS = 


. Suppose a is an integer. If 7| 4a, then 7|a. 


_ 
—) 


. Suppose a and b are integers. If a| b, then a | (3b? — b? +5b). 
. Suppose a,b,c,de Z. Ifa|b and c|d, then ac| bd. 

. IfxeR and 0<x<4, then 745 21. 

. Suppose x,y eR. If x? +5y = y?+5x, thenx=y or x+y=5. 

. IfneZ, then 5n?+3n+7 is odd. (Try cases.) 


. IfneZ, then n2+3n+4 is even. (Try cases.) 


—_ 
NS = 


ei 
a oO » & 


. If two integers have the same parity, then their sum is even. (Try cases.) 


_ 
«J 


. If two integers have opposite parity, then their product is even. 


_ 
fe 2) 


. Suppose x and y are positive real numbers. If x < y, then x? < y?. 


_ 
© 


. Suppose a,b and c are integers. If a?|b and b®|c, then a® | c. 


Richard Hammack Book of Proof: 


Treating Similar Cases 127 


20. 
21. 
22. 
23. 
24, 


25. 
26. 
27. 
28. 


If a is an integer and a”|a, then ae {-1,0,1}. 

If p is prime and 2 is an integer for which 0<k < p, then p divides (7). 
If n€N, then n? = 2(5) + (7). (You may need a separate case for n = 1.) 
If nN, then (?”) is even. 


IfneN and n= 2, then the numbers n! +2, n!+3, n!+4, n!+5,..., n!+n are all 
composite. (Thus for any n = 2, one can find n—1 consecutive composite numbers. 
This means there are arbitrarily large “gaps” between prime numbers.) 
Ifa,b,ceN and c<b <a, then AC) = (oe) 

Every odd integer is a difference of two squares. (Example 7 = 4? — 3”, etc.) 
Suppose a,beN. If ged(a,b) > 1, then b|a or 6 is not prime. 

Let a,b,c € Z. Suppose a and b are not both zero, and c 40. Prove that c-ged(a, b) < 
gced(ca,cb). 


Free PDF version [S)= 


CHAPTER 5 


Contrapositive Proof 


\ K Je now examine an alternative to direct proof called contrapositive 

proof. Like direct proof, the technique of contrapositive proof is 
used to prove conditional statements of the form “If P, then Q.” Although 
it is possible to use direct proof exclusively, there are occasions where 
contrapositive proof is much easier. 


5.1 Contrapositive Proof 


To understand how contrapositive proof works, imagine that you need to 
prove a proposition of the following form. 


Proposition If P, then Q. 


This is a conditional statement of form P > Q. Our goal is to show that 
this conditional statement is true. Recall that in Section 2.6 we observed 
that P > Q is logically equivalent to ~ @ >~ P. For convenience, we duplicate 
the truth table that verifies this fact. 


P|Q|~Q|~P | P=Q|~Q>~P 
T|\|T || F F T T 
T\|F | T F F F 
F\|T | F T T T 
F\F| T T T T 


According to the table, statements P > Q and ~ Q >~ P are different 
ways of expressing exactly the same thing. The expression ~ Q >~ P is 
called the contrapositive form of P > Q. (Do not confuse the two words 
contrapositive and converse. Recall from Section 2.4 that the converse of 
P=Q is the statement Q = P, which is not logically equivalent to P > Q.) 


Contrapositive Proof 129 


Since P > Q is logically equivalent to ~ Q >~ P, it follows that to prove 
P=Q is true, it suffices to instead prove that ~ Q >~ P is true. If we were 
to use direct proof to show ~ Q >~ P is true, we would assume ~ Q is true 
and use this to deduce that ~ P is true. This in fact is the basic approach of 
contrapositive proof, summarized as follows. 


Outline for Contrapositive Proof 


Proposition If P, then Qq. 


Proof. Suppose ~ Q. 


Therefore ~ P. | 


So the setup for contrapositive proof is very simple. The first line of the 
proof is the sentence “Suppose Q is not true.” (Or something to that effect.) 
The last line is the sentence “Therefore P is not true.” Between the first and 
last line we use logic and definitions to transform the statement ~ Q to the 
statement ~ P. 

To illustrate this new technique, and to contrast it with direct proof, we 
now prove a proposition in two ways: first with direct proof and then with 
contrapositive proof. 


Proposition Suppose xe Z. If 7x+9 is even, then x is odd. 


Proof. (Direct) Suppose 7x +9 is even. 

Thus 7x +9 = 2a for some integer a. 

Subtracting 6x +9 from both sides, we get x = 2a—6x-9. 

Thus x = 2a-6x-9=2a-6x-10+1=2(a-3x-5)+1. 

Consequently x =2b+1, where b=a-—3x-5€Z. 

Therefore x is odd. a 


Here is a contrapositive proof of the same statement: 


Proposition Suppose xe Z. If 7x+9 is even, then x is odd. 


Proof. (Contrapositive) Suppose x is not odd. 

Thus x is even, so x = 2a for some integer a. 

Then 7x +9 = 7(2a)+9=14a+8+1=2(7a+4)+1. 

Therefore 7x+9=256+1, where b is the integer 7a +4. 

Consequently 7x +9 is odd. 

Therefore 7x +9 is not even. | 


Free PDF version [S)= 


130 Contrapositive Proof 


Though the proofs have equal length, you may feel that the contrapositive 
proof was smoother. This is because it is easier to transform information 
about x into information about 7x +9 than the other way around. For our 
next example, consider the following proposition concerning an integer x: 


Proposition Suppose «¢ Z. If x?-6x+5 is even, then x is odd. 


A direct proof would be problematic. We would begin by assuming that 
x? —6x+5 is even, so x?—6x+5 = 2a. Then we would need to transform this 
into x =2b6+1 for be Z. But it is not quite clear how that could be done, for 
it would involve isolating an x from the quadratic expression. However the 
proof becomes very simple if we use contrapositive proof. 


Proposition Suppose xé¢ Z. If x?-6x+5 is even, then x is odd. 


Proof. (Contrapositive) Suppose x is not odd. 

Thus x is even, so x = 2a for some integer a. 

So x?—-6x+5 = (2a)?-6(2a)+5 = 4a?-12a+5 = 4a?-12a+4+1 = 2(2a7-6a+2)+1. 
Therefore x? —6x+5=2b+1, where is the integer 2a? — 6a +2. 
Consequently x? —6x +5 is odd. 

Therefore x? —6x+5 is not even. a 


In summary, since x being not odd (~ Q) resulted in x? —6x +5 being not 
even (~ P), then x? —6x+5 being even (P) means that x is odd (Q). Thus we 
have proved P > Q by proving ~ Q >~ P. Here is another example: 


Proposition Suppose x,yeéR. If y?+ yx? <x3+xy?, then y <x. 


Proof. (Contrapositive) Suppose it is not true that y <x, so y>x. 
Then y—x>0. Multiply both sides of y—x>0 by the positive value x? + y?. 


(y—x\(x%+y?) > O(n? +y?) 
yx? + y? — x3 — xy? > 0 


3 


y? + yx? > x +xy" 


Therefore y? + yx? >x°+xy?, so it is not true that y?+ yx? < x3 +xy?. | 


Proving “If P, then Q,” with the contrapositive approach necessarily 
involves the negated statements ~ P and ~ Q. In working with these we 
may have to use the techniques for negating statements (e.g., DeMorgan’s 
laws) discussed in Section 2.10. We consider such an example next. 


Richard Hammack Book of Proof 


Congruence of Integers 131 


Proposition Suppose x,ye Z. If5{xy, then 5x and 5{y. 


Proof. (Contrapositive) Suppose it is not true that 5{x and 5{ y. 

By DeMorgan’s law, it is not true that 5{x or it is not true that 5{y. 
Therefore 5|x or 5| y. We consider these possibilities separately. 

Case 1. Suppose 5|x. Then x = 5a for some ae Z. 

From this we get xy = (5a)y = 5(ay), and that means 5| xy. 

Case 2. Suppose 5| y. Then y= 5a for some ae Z. 

From this we get xy = x(5a) = 5(xa), and that means 5| xy. 

The above cases show that 5| xy, so it is not true that 5{xy. a 


5.2 Congruence of Integers 


Now is a good time to introduce a new definition that occurs in many 
branches of mathematics and will surely play a role in some of your later 
courses. But our main reason for introducing it is that it provides more 
opportunities to practice writing proofs, both direct and contrapositive. 


Definition 5.1 Given integers a and b and neN, we say that a and b 
are congruent modulo n if n|(a— 6). We express this as a = b (mod n). 
If a and 6 are not congruent modulo n, we write this as a Z b (mod n). 


Example 5.1 Here are some examples: 
9 =1 (mod 4) because 4| (9-1). 

6 = 10 (mod 4) because 4 | (6— 10). 
1448 (mod 4) because 4{(14-8). 

20 = 4 (mod 8) because 8 | (20 — 4). 
17=-4 (mod 3) because 3 | (17 —(—4)). 


ao PF Of 


In practical terms, a = b (mod n) means that a and b have the same 
remainder when divided by n. For example, we saw above that 6 = 10 
(mod 4) and indeed 6 and 10 both have remainder 2 when divided by 4. Also 
we saw 1428 (mod 4), and sure enough 14 has remainder 2 when divided 
by 4, while 8 has remainder 0. 

To see that this is true in general, note that if a and b both have the 
same remainder r when divided by n, then it follows that a=kn+r and 
b =én+r for some k,fe Z. Then a—b = (kn+r)—-(€n+r)= n(k-2@). But 
a—b=n(k—-?) means n|(a—b), soa=b (mod n). Conversely, this chapter’s 
Exercise 32 asks you to show that if a = b (mod n), then a and b have the 
same remainder when divided by n. 


Free PDF version [S)= 


132 Contrapositive Proof 


We conclude this section with several proofs involving congruence of 
integers, but you will also test your skills with other proofs in the exercises. 


Proposition Leta,b¢ Zand neN. Ifa=b (mod n), then a? = b? (mod n). 


Proof. We will use direct proof. Suppose a = b (mod n). 

By definition of congruence of integers, this means n |(a—)b). 

Then by definition of divisibility, there is an integer c for which a— b= nc. 
Now multiply both sides of this equation by a+. 


a-b = ne 
(a—b\at+b) = nc(at+b) 
a2—b2 = nce(a+b) 


Since c(a +b) € Z, the above equation tells us n | (a? — 6”). 
According to Definition 5.1, this gives a? = b? (mod n). | 


Let’s pause to consider this proposition’s meaning. It says a = 6b (mod n) 
implies a? = b? (mod n). In other words, it says that if integers a and b have 
the same remainder when divided by n, then a? and b? also have the same 
remainder when divided by n. As an example of this, 6 and 10 have the same 
remainder (2) when divided by n = 4, and their squares 36 and 100 also have 
the same remainder (0) when divided by n = 4. The proposition promises 
this will happen for all a, b and n. In our examples we tend to concentrate 
more on how to prove propositions than on what the propositions mean. 
This is reasonable since our main goal is to learn how to prove statements. 
But it is helpful to sometimes also think about the meaning of what we 
prove. 


Proposition Leta,b,ceZandneN. Ifa=b (mod n), then ac = be (mod n). 


Proof. We employ direct proof. Suppose a = 6 (mod n). By Definition 5.1, it 
follows that n|(a—6). Therefore, by definition of divisibility, there exists an 
integer k for which a—6b=nk. Multiply both sides of this equation by c to 
get ac—bc=nkc. Thus ac— bc = n(kc) where kc € Z, which means n | (ac— bc). 
By Definition 5.1, we have ac = bc (mod n). a 


Contrapositive proof seems to be the best approach in the next example, 
since it will eliminate the symbols { and #. 


Richard Hammack Book of Proof 


Mathematical Writing 133 


Proposition Suppose a,b¢ZandneN. If 12a 4126 (mod n), then n{ 12. 


Proof. (Contrapositive) Suppose n|12. Then 12=nc for some ce Z. Thus 
12(a—b) = nc(a—b). 


From this, 12a—12b = n(ca—cb). Because ca—cbe Z, we get n | (12a—125). 
This in turn means 12a = 126 (mod n). | 


5.3 Mathematical Writing 


Now that we have begun writing proofs, it is a good time to contemplate the 
craft of writing. Unlike logic and mathematics, where there is a clear-cut 
distinction between what is right or wrong, the difference between good and 
bad writing is sometimes a matter of opinion. But there are some standard 
guidelines that will make your writing clearer. Some are listed below. 


1. Begin each sentence with a word, not a mathematical symbol. 
The reason is that sentences begin with capital letters, but mathematical 
symbols are case sensitive. Because x and X can have entirely different 
meanings, putting such symbols at the beginning of a sentence can lead 
to ambiguity. Here are some examples of bad usage (marked with x) 
and good usage (marked with V): 


A is a subset of B. x 
The set A is a subset of B. v 
x is an integer, so 2x+5 is an integer. x 


Because x is an integer, 2x+5 is an integer. 

x? —x+2=0 has two solutions. x 
X*~—x+2=0 has two solutions. x (and silly too) 
The equation x? —x+2=0 has two solutions. v 


2. End each sentence with a period, even when the sentence ends with 


a mathematical symbol or expression. 
ll 1 
Euler proved that }' — = 
paik® pep l-s 
ll 1 
Euler proved that )° — = [] 


=, ks is 
k=1 peP ps 


Mathematical statements (equations, etc.) are like English phrases that 
happen to contain special symbols, so use normal punctuation. 


Free PDF version [S)= 


134 Contrapositive Proof 


3. Separate mathematical symbols and expressions with words. 
Not doing this can cause confusion by making distinct expressions appear 
to merge. Compare the clarity of the following examples. 


Because x?-1=0,x=lorx=-l. x 
Because x? —1=0, it follows that x =1 or x=—1. $ 
Unlike AUB, ANB equals @. x 
Unlike A UB, the set ANB equals @. v 


4. Avoid misuse of symbols. Symbols such as =, <, ¢, €, etc., are not 
words. While it is appropriate to use them in mathematical expressions, 
they are out of place in other contexts. 


Since the two sets are =, one is a subset of the other. x 
Since the two sets are equal, one is a subset of the other. v 
The empty set is a ¢ of every set. x 
The empty set is a subset of every set. v 
Since a is odd and x odd => x? odd, a? is odd. x 
Since a is odd and any odd number squared is odd, a? isodd. V 


5. Avoid using unnecessary symbols. Mathematics is confusing enough 
without them. Don’t muddy the water even more. 


No set X has negative cardinality. x 
No set has negative cardinality. v 


6. Use the first person plural. In mathematical writing, it is common 
to use the words “we” and “us” rather than “I,” “you” or “me.” It is as if 
the reader and writer are having a conversation, with the writer guiding 
the reader through the details of the proof. 


7. Use the active voice. This is just a suggestion, but the active voice 
makes your writing more lively. (And briefer too.) 


The value x = 3 is obtained through division of both sides by 5. x 
Dividing both sides by 5, we get x =3. v 


8. Explain each new symbol. In writing a proof, you must explain the 
meaning of every new symbol you introduce. Failure to do this can lead 
to ambiguity, misunderstanding and mistakes. For example, consider 
the following two possibilities for a sentence in a proof, where a and b 
have been introduced on a previous line. 


Richard Hammack Book of Proof 


Mathematical Writing 135 


Since a |b, it follows that b =ac. x 
Since a |b, it follows that b =ac for some integer c. V 


If you use the first form, then the reader may momentarily scan back- 
wards looking for where the c entered into the picture, not realizing at 
first that it came from the definition of divides. 


9. Watch out for “it.” The pronoun “it” causes confusion when it is 
unclear what it refers to. If there is any possibility of confusion, you 
should avoid “it.” Here is an example: 


Since X CY, and 0< |X|, we see that it is not empty. x 
Is “it” X or Y? Either one would make sense, but which do we mean? 
Since X CY, and 0< |X|, we see that Y is not empty. v 


10. Since, because, as, for, so. In proofs, it is common to use these 
words as conjunctions joining two statements, and meaning that one 
statement is true and as a consequence the other true. The following 
statements all mean that P is true (or assumed to be true) and as a 
consequence Q is true also. 


@ since P Q because P Q,as P Q, for P P,soQ 
Since P, Q Because P, Q As P,Q 


Notice that the meaning of these constructions is different from that of 
“If P, then Q,” for they are asserting not only that P implies Q, but also 
that P is true. Exercise care in using them. It must be the case that P 
and Q are both statements and that Q really does follow from P. 


xEN, so Z x 
xeEN,soxeZ v 


11. Thus, hence, therefore, consequently. These adverbs precede a 
statement that follows logically from previous sentences or clauses. Be 
sure that a statement follows them. 


Therefore 2k +1. x 
Therefore a = 2k +1. V 


12. Clarity is the gold standard of mathematical writing. If you 
think breaking a rule makes your writing clearer, then break the rule. 


Your mathematical writing will evolve with practice. One of the best 
ways to develop a good mathematical writing style is to read other people’s 
proofs. Adopt what works and avoid what doesn’t. 


Free PDF version [S)= 


136 Contrapositive Proof 


Exercises for Chapter 5 


A. Prove the following statements with contrapositive proof. (In each case, think 
about how a direct proof would work. In most cases contrapositive is easier.) 

. Suppose n€ Z. If n? is even, then n is even. 

. Suppose n€ Z. If n? is odd, then n is odd. 

Suppose a,b € Z. If a2(b? — 2b) is odd, then a and 6 are odd. 

Suppose a,b,ce Z. If a does not divide bc, then a does not divide b. 

Suppose xe R. If x2+5x <0 then x <0. 

Suppose xe R. If x? -x>0 then x>-1. 

Suppose a,be Z. If both ab and a+b are even, then both a and b are even. 


. Suppose xe R. If x° — 4x4 + 3x3 -—x?+3x-420, then x=0. 
. Suppose ne Z. If 3{n?, then 3}n. 
. Suppose x,y,z€Z and x0. If xyz, then x{y and x{z. 


CHAKRA P wd 


—_— 
= © 


. Suppose x,y Z. If x2(y +3) is even, then x is even or y is odd. 


_ 
iw) 


. Suppose a€ Z. If a? is not divisible by 4, then a is odd. 
13. Suppose xe R. If x°+ 7x? +5x«>x4+x?+8, then x>=0. 


B. Prove the following statements using either direct or contrapositive proof. 


14. Ifa,b€Z anda and b have the same parity, then 3a +7 and 7b —4 do not. 

15. Suppose «€¢ Z. If x? -1 is even, then x is odd. 

16. Suppose x, ye Z. If x+y is even, then x and y have the same parity. 

17. If n is odd, then 8 | (n?-1). 

18. Ifa,be Z, then (a+b)? =a? +b? (mod 3). 

19. Let a,b,ce Z andneN. Ifa=b (mod n) and a=c (mod n), then c = b (mod n). 

20. Ifae Zand a=1 (mod 5), then a2 =1 (mod 5). 

21. Let a,be€Z and neN. Ifa=b (mod n), then a® = b? (mod n). 

22. Let ae Z,neéN. If a has remainder r when divided by n, then a=r (mod n). 

23. Let a,beZ and neN. Ifa=b (mod n), then a? = ab (mod n). 

24, Ifa=b (mod n) and c=d (mod n), then ac = bd (mod n). 

25. Let neN. If 2”—1 is prime, then n is prime. 

26. If n= 2" -1 for kEN, then every entry in Row n of Pascal’s Triangle is odd. 

27. If a=0 (mod 4) or a=1 (mod 4), then ($) is even. 

28. If n¢ Z, then 4{(n?-3). 

29. If integers a and 0 are not both zero, then gcd(a, b) = ged(a — b,d). 

30. If a=b (mod n), then ged(a,n) = ged(b,n). 

31. Suppose the division algorithm applied to a and b yields a= qb+r. Prove 
gcd(a, b) = gcd(r,d). 

32. Ifa=b (mod n), then a and b have the same remainder when divided by n. 


Richard Hammack Book of Proof 


CHAPTER 6 


Proof by Contradiction 


e now explore a third method of proof: proof by contradiction. This 

method is not limited to proving just conditional statements—it can 

be used to prove any kind of statement whatsoever. The basic idea is to 

assume that the statement we want to prove is false, and then show that 

this assumption leads to nonsense. We are then led to conclude that we 

were wrong to assume the statement was false, so the statement must be 
true. As an example, consider the following proposition and its proof. 


Proposition Ifa,b¢ Z, then a?—4b #2. 


Proof. Suppose this proposition is false. 

This conditional statement being false means there exist numbers a and b 
for which a,b € Z is true, but a? — 4b # 2 is false. 

In other words, there exist integers a,b € Z for which | a? — 4b = 2. 
From this equation we get a? = 4b + 2 = 2(2b + 1), so a? is even. 
Because a? is even, it follows that a is even, so a = 2c for some integer c. 
Now plug a = 2c back into the boxed equation to get (2c)? — 4b = 2, 

so 4c? — 4b =2. Dividing by 2, we get 2c? -2b=1. 

Therefore 1 = 2(c? — b), and because c? —b € Z, it follows that 1 is even. 

We know 1 is not even, so something went wrong. 

But all the logic after the first line of the proof is correct, so it must be that 
the first line was incorrect. In other words, we were wrong to assume the 
proposition was false. Thus the proposition is true. | 


You may be a bit suspicious of this line of reasoning, but in the next 
section we will see that it is logically sound. For now, notice that at the end of 
the proof we deduced that 1 is even, which conflicts with our knowledge that 
1 is odd. In essence, we have obtained the statement (1 is odd)A ~ (1 is odd), 
which has the form Ca ~ C. Notice that no matter what statement C is, and 
whether or not it is true, the statement Ca ~ C is false. A statement—like 
this one—that cannot be true is called a contradiction. Contradictions 
play a key role in our new technique. 


138 Proof by Contradiction 


6.1 Proving Statements with Contradiction 


Let’s now see why the proof on the previous page is logically valid. In that 
proof we needed to show that a statement P : (a,b € Z) > (a?—4b # 2) was true. 
The proof began with the assumption that P was false, that is that ~ P was 
true, and from this we deduced Ca ~ C. In other words we proved that ~ P 
being true forces CA ~ C to be true, and this means that we proved that the 
conditional statement (~ P) => (C a ~ C) is true. To see that this is the same 
as proving P is true, look at the following truth table for (~ P)>(C a ~C). 
Notice that the columns for P and (~ P) >(C a ~ C) are exactly the same, so 
P is logically equivalent to (~ P)>(C A ~C). 


~P | Ca~C] (~P)>(CA~C) 


>> ae A |) 
SR) SPR] a 
te >| 
SS) ),) oe 
°c | 


Therefore to prove a statement P, it suffices to instead prove the conditional 
statement (~ P)>(C A~C). This can be done with direct proof: Assume 
~P and deduce C A ~C. Here is the outline: 


Outline for Proof by Contradiction 


Proposition P. 


Proof. Suppose ~ P. 


Therefore C A ~C. | 


A slightly unsettling feature of this method is that we may not know at 
the beginning of the proof what the statement C is going to be. In doing the 
scratch work for the proof, you assume that ~ P is true, then deduce new 
statements until you have deduced some statement C and its negation ~ C. 

If this method seems confusing, look at it this way. In the first line of 
the proof we suppose ~ P is true, that is, we assume P is false. But if P is 
really true then this contradicts our assumption that P is false. But we 
haven’t yet proved P to be true, so the contradiction is not obvious. We use 
logic and reasoning to transform the non-obvious contradiction ~ P to an 
obvious contradiction Ca ~ C. 


Richard Hammack Book of Proof 


Proving Statements with Contradiction 139 


The idea of proof by contradiction is ancient, going back at least to the 
Pythagoreans, who used it to prove that certain numbers are irrational. 
Our next example follows their logic to prove that V2 is irrational. Recall 
that a number is rational if it is a fraction of integers, and it is irrational if 
it cannot be expressed as a fraction of integers. Here is the exact definition: 


Definition 6.1 A real number x is rational if x = ¢ for some a,b € Z. 
Also, x is irrational if it is not rational, that is if x 4 ¢ for every a,b« Z. 


We are now ready to use contradiction to prove that V2 is irrational. 
According to the outline, the first line of the proof should be “Suppose that 
it is not true that V2 is irrational.” But it is helpful (though not mandatory) 
to tip our reader off to the fact that we are using proof by contradiction. 
One standard way of doing this is to make the first line “Suppose for the 
sake of contradiction that it is not true that V2 is irrational." 


Proposition The number v2 is irrational. 


Proof. Suppose for the sake of contradiction that it is not true that V2 is 
irrational. Then v2 is rational, so there are integers a and b for which 


V2=<. (6.1) 

b 
Let this fraction be fully reduced; in particular, this means that a and 
b are not both even. (If they were both even, then the fraction could be 
further reduced by factoring 2’s from the numerator and denominator and 
canceling.) Squaring both sides of Equation 6.1 gives 2 = a, and therefore 


a? = 2b?. (6.2) 


From this it follows that a? is even. But we proved earlier (Exercise 1 on 
page 136) that a? being even implies a is even. Thus, as we know that a 
and 6 are not both even, it follows that b is odd. Now, since a is even there 
is an integer c for which a = 2c. Plugging this value for a into Equation (6.2), 
we get (2c)? = 2b”, so 4c? = 2b”, and hence b? = 2c”. This means b? is even, 
so b is even also. But previously we deduced that b is odd. Thus we have 
the contradiction b is even and b is odd. a 


To appreciate the power of proof by contradiction, imagine trying to 
prove that V2 is irrational without it. Where would we begin? What would 
be our initial assumption? There are no clear answers to these questions. 


Free PDF version [S)= 


140 Proof by Contradiction 


Proof by contradiction gives us a starting point: Assume v2 is rational, and 
work from there. 

In the above proof we got the contradiction (6 is even) A ~(b is even) which 
has the form Ca ~ C. In general, your contradiction need not necessarily be 
of this form. Any statement that is clearly false is sufficient. For example 
242 would be a fine contradiction, as would be 4| 2, provided that you could 
deduce them. 

Here is another ancient example, dating back at least as far as Euclid: 


Proposition There are infinitely many prime numbers. 


Proof. For the sake of contradiction, suppose there are only finitely many 
prime numbers. Then we can list all the prime numbers as pj, po, p3,... Pn, 
where p1 = 2, p2 =3,p3=5,p4=7 and so on. Thus p, is the nth and largest 
prime number. Now consider the number a = (p1p2p3:::pn)+1, that is, a is 
the product of all prime numbers, plus 1. Now a, like any natural number 
greater than 1, has at least one prime divisor, and that means pz, |a for at 
least one of our n prime numbers p;. Thus there is an integer c for which 
a=cpp, which is to say 


(Pp1P2P3°**Pk-1PkPk+1°**Pn)+1=cprp. 


Dividing both sides of this by pz gives us 


1 
(Pp1P2P3°** Pk-1Pk+1°**Pn) + ae =¢, 


so 
1 
— =C—(p1p2p3-*: PR-1Pk+1°** Pn). 
Pk 
The expression on the right is an integer, while the expression on the left is 
not an integer. This is a contradiction. a 


Proof by contradiction often works well in proving statements of the form 
Vx,P(x). The reason is that the proof set-up involves assuming ~ Vx, P(x), 
which as we know from Section 2.10 is equivalent to 4x, ~ P(x). This gives 
us a specific x for which ~ P(x) is true, and often that is enough to produce 
a contradiction. Here is an example: 


Proposition For every real number x € [0,7/2], we have sinx + cosx > 1. 


Proof. Suppose for the sake of contradiction that this is not true. 
Then there exists an x € [0,2/2] for which sinx+cosx < 1. 


Richard Hammack Book of Proof 


Proving Conditional Statements by Contradiction 141 


Since x € [0,7/2], neither sinx nor cosx is negative, so 0 < sinx+cosx < 1. 
Thus 0? <(sinx + cosx)? < 12, which gives 0? < sin?.x + 2sinxcosx + cos? x < 1?. 
As sin? x +cos?x = 1, this becomes 0 < 1+ 2sinxcosx <1, so 1+2sinxcosx <1. 
Subtracting 1 from both sides gives 2sinxcosx <0. 

But this contradicts the fact that neither sinx nor cosx is negative. a 


6.2 Proving Conditional Statements by Contradiction 


Since the previous two chapters dealt exclusively with proving conditional 
statements, we now formalize the procedure in which contradiction is used 
to prove a conditional statement. Suppose we want to prove a proposition 
of the following form. 


Proposition If P, then qQ. 


Thus we need to prove that P > Q is true. Proof by contradiction begins 
with the assumption that ~ (P > Q) is true, that is, that P > Q is false. But 
we know that P => Q being false means that it is possible that P can be true 
while Q is false. Thus the first step in the proof is to assume P and ~ Q. 
Here is an outline: 


Outline for Proving a Conditional 
Statement with Contradiction 


Proposition If P, then Q. 


Proof. Suppose P and ~ Q. 


Therefore C A ~C. | 


To illustrate this new technique, we revisit a familiar result: If a? is 
even, then a is even. According to the outline, the first line of the proof 
should be “For the sake of contradiction, suppose a? is even and a is not 
even.” 


Proposition Suppose a€« Z. If a? is even, then a is even. 


2 is even and a is not even. 


Proof. For the sake of contradiction, suppose a 
Then a? is even, and a is odd. 

Since a is odd, there is an integer c for which a = 2c +1. 
Then a? = (2c + 1)? = 4c? +4c + 1 = 2(2c? + 2c) +1, so a? is odd. 


Thus a? is even and a? is not even, a contradiction. | 


Free PDF version [S)= 


142 Proof by Contradiction 


Here is another example. 


Proposition Ifa,b¢Zanda=2, thena{b ora{(b+1). 


Proof. Suppose for the sake of contradiction there exist a,b ¢€ Z with a= 2, 
and for which it is not true that a{b or a{(b+1). 

By DeMorgan’s law, we have a| 6b and a|(b+1). 

The definition of divisibility says there are c,d € Z with b =ac and b+1=ad. 
Subtracting one equation from the other gives ad —ac=1, so a(d—c)=1. 
Since a is positive, d—c is also positive (otherwise a(d—c) would be negative). 
Then d—c is a positive integer and a(d —c)=1, so a= 1d —c) <2. 

Thus we have a= 2 and a <2, a contradiction. a 


6.3 Combining Techniques 


Often in more complex proofs several proof techniques are combined within 
a single proof. For example, in proving a conditional statement P > Q, we 
might begin with direct proof and thus assume P to be true with the aim of 
ultimately showing Q is true. But the truth of Q might hinge on the truth 
of some other statement R which—together with P—would imply Q. We 
would then need to prove R, and we would use whichever proof technique 
seems most appropriate. This can lead to “proofs inside of proofs.” Consider 
the following example. The overall approach is direct, but inside the direct 
proof is a separate proof by contradiction. 


Proposition Every non-zero rational number can be expressed as a prod- 
uct of two irrational numbers. 


Proof. This proposition can be reworded as follows: Ifr is a non-zero rational 
number, then r is a product of two irrational numbers. In what follows, we 
prove this with direct proof. 

Suppose r is a non-zero rational number. Then r = ¢ for integers a and b. 
Also, r can be written as a product of two numbers as follows: 


rave. 


We know v2 is irrational, so to complete the proof we must show is also 


V2 
irrational. F 
To show this, assume for the sake of contradiction that AG is rational. 


This means 


v2 


alo 


Richard Hammack Book of Proof 


Some Words of Advice 143 


for integers c and d, so 


Jaa", 
Cc 


But we know r= ¢, which combines with the above equation to give 


This means v2 is rational, which is a contradiction because we know it is 


irrational. Therefore 3 is irrational. 


Hence r= V2: a is a product of two irrational numbers. Ba 


For another example of a proof-within-a-proof, try Exercise 5 at the end 
of this chapter (or see its solution). Exercise 5 asks you to prove that V3 
is irrational. This turns out to be slightly trickier than proving that V2 is 
irrational. 


6.4 Some Words of Advice 


Despite the power of proof by contradiction, it’s best to use it only when the 
direct and contrapositive approaches do not seem to work. The reason for 
this is that a proof by contradiction can often have hidden in it a simpler 
contrapositive proof, and if this is the case it’s better to go with the simpler 
approach. Consider the following example. 


Proposition Suppose a€ Z. If a?—2a+7 is even, then a is odd. 


Proof. (Contradiction) Suppose a? — 2a +7 is even and a is not odd. 

That is, suppose a? —2a+7 is even and a is even. 

Since a is even, there is an integer c for which a = 2c. 

Then a? - 2a +7 = (2c)? — 2(2c)+ 7 = 2(2c? — 2c + 3) +1, so a? - 2a +7 is odd. 
Thus a? -2a+7 is both even and odd, a contradiction. | 


Though there is nothing really wrong with this proof, notice that part 
of it assumes a is not odd and deduces that a? —2a+7 is not even. That is 
the contrapositive approach! Thus it would be more efficient to proceed as 
follows, using contrapositive proof. 


Proposition Suppose a€« Z. If a?—2a+7 is even, then a is odd. 


Proof. (Contrapositive) Suppose a is not odd. 

Then a is even, so there is an integer c for which a = 2c. 

Then a? —2a +7 = (2c)? —2(2c)+ 7 = 2(2c? — 2c + 3) +1, so a? - 2a+7 is odd. 
Thus a? -2a+7 is not even. a 


Free PDF version [S)= 


144 Proof by Contradiction 


Exercises for Chapter 6 


A. Use the method of proof by contradiction to prove the following statements. (In 
each case, you should also think about how a direct or contrapositive proof would 
work. You will find in most cases that proof by contradiction is easier.) 

. Suppose n€ Z. If n is odd, then n? is odd. 

Suppose n€ Z. If n? is odd, then n is odd. 

. Prove that V2 is irrational. 

. Prove that V6 is irrational. 

. Prove that V3 is irrational. 

. Ifa,be€ Z, then a2-4b-2 0. 

. Ifa,b eZ, then a?—4b-3 0. 

. Suppose a,b,c € Z. If a2 +b? =c?, then a or b is even. 


CONTR wD 


. Suppose a,beR. Ifa is rational and abd is irrational, then b is irrational. 


—_ 
i=) 


. There exist no integers a and b for which 21a + 30d = 1. 


_ 
_ 


. There exist no integers a and b for which 18a + 66 = 1. 


_ 
i) 


. For every positive x € Q, there is a positive y € Q for which y <x. 


_ 
iv) 


. For every x € [2/2,7], sinx—cosx > 1. 

. If A and B are sets, then AnN(B-—A)=@. 

. Ifb¢Z and bf{k for every k EN, then b=0. 

. Ifa and b are positive real numbers, then a+ 6 =2Vab. 


— i 
NO oO Pp 


. For every ne Z, 4} (n? +2). 
18. Suppose a,b € Z. If 4| (a? + 62), then a and b are not both odd. 


B. Prove the following statements using any method from Chapters 4, 5 or 6. 
19. The product of any five consecutive integers is divisible by 120. (For example, 
the product of 3,4,5,6 and 7 is 2520, and 2520 = 120-21.) 


20. We say that a point P = (x,y) in R? is rational if both x and y are rational. 
More precisely, P is rational if P = (x,y) € Q?. An equation F(x, y) = 0 is said 
to have a rational point if there exists xo, yo € Q such that F(xo, yo) = 0. For 
example, the curve x? + y?—1=0 has rational point (xo, yo) = (1,0). Show that 
the curve x? + y?—3=0 has no rational points. 

21. Exercise 20 (above) involved showing that there are no rational points on the 
curve x?+ y?-—3=0. Use this fact to show that V3 is irrational. 

22. Explain why x? + y?-—3=0 not having any rational solutions (Exercise 20) 
implies x? + y? - 3* = 0 has no rational solutions for k an odd, positive integer. 

23. Use the above result to prove that V3" is irrational for all odd, positive k. 


24, The number log, 3 is irrational. 


Richard Hammack Book of Proof 


Part Ill 


More on Proof 


CHAPTER 7 


Proving Non-Conditional Statements 


he last three chapters introduced three major proof techniques: direct, 

contrapositive and contradiction. These three techniques are used to 
prove statements of the form “If P, then Q.” As we know, most theorems 
and propositions have this conditional form, or they can be reworded to 
have this form. Thus the three main techniques are quite important. But 
some theorems and propositions cannot be put into conditional form. For 
example, some theorems have form “P if and only if Q.” Such theorems 
are biconditional statements, not conditional statements. In this chapter 
we examine ways to prove them. In addition to learning how to prove 
if-and-only-if theorems, we will also look at two other types of theorems. 


7.1 If-and-Only-If Proof 


Some propositions have the form 
P if and only if Q. 


We know from Section 2.4 that this statement asserts that both of the 
following conditional statements are true: 


If P, then Q. 
If Q, then P. 


So to prove “P if and only if Q,” we must prove two conditional statements. 
Recall from Section 2.4 that Q => P is called the converse of P>Q. Thus 
we need to prove both P > Q and its converse. These are both conditional 
statements, so we may prove them with either direct, contrapositive or 
contradiction proof. Here is an outline: 


Outline for If-and-Only-If Proof 
Proposition P if and only if Q. 


Proof. 
[Prove P > Q using direct, contrapositive or contradiction proof. ] 
[Prove Q => P using direct, contrapositive or contradiction proof.] m 


148 Proving Non-Conditional Statements 


Let’s start with a very simple example. You already know that an integer 
n is odd if and only if n? is odd, but let’s prove it anyway, just to illustrate 
the outline. In this example we prove (n is odd)=>(n? is odd) using direct 
proof and (n? is odd)=>(n is odd) using contrapositive proof. 


Proposition The integer n is odd if and only if n? is odd. 


Proof. First we show that n being odd implies that n? is odd. Suppose n 
is odd. Then, by definition of an odd number, n = 2a + 1 for some integer a. 
Thus n? = (2a+ 1)? = 4a? + 4a +: 1 = 2(2a? + 2a)+1. This expresses n” as twice 
an integer, plus 1, so n? is odd. 

Conversely, we need to prove that n? being odd implies that n is odd. We 
use contrapositive proof. Suppose n is not odd. Then n is even, so n = 2a for 
some integer a (by definition of an even number). Thus n? = (2a)? = 2(2a?), 
so n” is even because it’s twice an integer. Thus n? is not odd. We’ve now 
proved that if n is not odd, then n? is not odd, and this is a contrapositive 
proof that if n? is odd then n is odd. a 


In proving “P if and only if Q,” you should begin a new paragraph when 
starting the proof of @ > P. Since this is the converse of P > Q, it’s a good 
idea to begin the paragraph with the word “Conversely” (as we did above) to 
remind the reader that you've finished the first part of the proof and are 
moving on to the second. Likewise, it’s a good idea to remind the reader of 
exactly what statement that paragraph is proving. 

The next example uses direct proof in both parts of the proof. 


Proposition Suppose a and b are integers. Then a =} (mod 6) if and only 
if a = b (mod 2) and a=b (mod 8). 


Proof. First we prove that if a = b (mod 6), then a = 6 (mod 2) and a=6 
(mod 3). Suppose a = b (mod 6). This means 6|(a—)), so there is an integer 
n for which 

a-—b=6n. 


From this we get a— b = 2(3n), which implies 2|(a— 6), soa =b (mod 2). But 
we also get a —b = 3(2n), which implies 3|(a—b), soa=b (mod 3). Therefore 
a=b (mod 2) and a= b (mod 8). 

Conversely, suppose a = b (mod 2) and a= b (mod 8). Since a = b (mod 2) 
we get 2|(a—b), so there is an integer k for which a— 6 = 2k. Therefore a—b 
is even. Also, from a = 6b (mod 8) we get 3|(a—b), so there is an integer ¢ for 
which 

a-b=38. 


Richard Hammack Book of Proof 


Equivalent Statements 149 


But since we know a — b is even, it follows that @ must be even also, for 
if it were odd then a-— b = 3¢ would be odd (because a— 6 would be the 
product of two odd integers). Hence ¢ = 2m for some integer m. Thus 
a-b=3l=3-2m=6m. This means 6|(a—b), soa=b (mod 6). | 


Since if-and-only-if proofs simply combine methods with which we are 
already familiar, we will not do any further examples in this section. But it 
is of utmost importance that you practice your skill on some of this chapter’s 
exercises. 


7.2 Equivalent Statements 


In other courses you will sometimes encounter a certain kind of theorem 
that is neither a conditional nor a biconditional statement. Instead, it 
asserts that a list of statements is “equivalent.” You saw this (or will see it) 
in your linear algebra textbook, which featured the following theorem: 


Theorem Suppose A is an nxn matrix. The following statements are 
equivalent: 

(a) The matrix A is invertible. 

(b) The equation Ax =b has a unique solution for every be R”. 

(c) The equation Ax =0 has only the trivial solution. 

(d) The reduced row echelon form of A is I. 

(e) det(A) #0. 

(f) The matrix A does not have 0 as an eigenvalue. 


When a theorem asserts that a list of statements is “equivalent,” it is 
asserting that either the statements are all true, or they are all false. Thus 
the above theorem tells us that whenever we are dealing with a particular 
nxn matrix A, then either the statements (a) through (f) are all true for A, 
or statements (a) through (f) are all false for A. For example, if we happen to 
know that det(A) 40, the theorem assures us that in addition to statement 
(e) being true, all the statements (a) through (f) are true. On the other 
hand, if it happens that det(A) = 0, the theorem tells us that all statements 
(a) through (f) are false. In this way, the theorem multiplies our knowledge 
of A by a factor of six. Obviously that can be very useful. 

What method would we use to prove such a theorem? In a certain 
sense, the above theorem is like an if-and-only-if theorem. An if-and-only-if 
theorem of form P © Q asserts that P and @ are either both true or both 
false, that is, that P and Q are equivalent. To prove P © Q we prove P>Q 
followed by Q => P, essentially making a “cycle” of implications from P to Q 


Free PDF version [S)= 


150 Proving Non-Conditional Statements 


and back to P. Similarly, one approach to proving the theorem about the 
nxn matrix would be to prove the conditional statement (a) => (b), then 
(b) > (c), then (c) > (d), then (d) => (e), then (e) > (f) and finally (f) => (a). 
This pattern is illustrated below. 


(a) = (6) => (c) 
i y 
ff) = () — @ 


Notice that if these six implications have been proved, then it really does 
follow that the statements (a) through (f) are either all true or all false. 
If one of them is true, then the circular chain of implications forces them 
all to be true. On the other hand, if one of them (say (c)) is false, the fact 
that (6) => (c) is true forces (b) to be false. This combined with the truth of 
(a) > (b) makes (a) false, and so on counterclockwise around the circle. 

Thus to prove that n statements are equivalent, it suffices to prove 
n conditional statements showing each statement implies another, in a 
circular pattern. But it is not necessary that the pattern be circular. The 
following schemes would also do the job: 


(a) => () —> (ce) 


(ff) = () — @d) 


(a) — (b+) — (ec) 


f) — () — @) 


But a circular pattern yields the fewest conditional statements that 
must be proved. Whatever the pattern, each conditional statement can be 
proved with either direct, contrapositive or contradiction proof. 

Though we shall not do any of these proofs in this text, you are sure to 
encounter them in subsequent courses. 


7.3 Existence Proofs; Existence and Uniqueness Proofs 


Up until this point, we have dealt with proving conditional statements 
or with statements that can be expressed with two or more conditional 
statements. Generally, these conditional statements have form P(x) > Q(x). 
(Possibly with more than one variable.) We saw in Section 2.8 that this can 
be interpreted as a universally quantified statement V x, P(x) > Q(x). 


Richard Hammack Book of Proof 


Existence Proofs; Existence and Uniqueness Proofs 151 


Thus, conditional statements are universally quantified statements, 
so in proving a conditional statement—whether we use direct, contraposi- 
tive or contradiction proof—we are really proving a universally quantified 
statement. 

But how would we prove an existentially quantified statement? What 
technique would we employ to prove a theorem of the following form? 


dx, R(x) 


This statement asserts that there exists some specific object x for which 
R(x) is true. To prove 4x, R(x) is true, all we would have to do is find and 
display an example of a specific x that makes R(x) true. 

Though most theorems and propositions are conditional (or if-and- 
only-if) statements, a few have the form 4x,R(x). Such statements are 
called existence statements, and theorems that have this form are called 
existence theorems. To prove an existence theorem, all you have to do 
is provide a particular example that shows it is true. This is often quite 
simple. (But not always!) Here are some examples: 


Proposition There exists an even prime number. 


Proof. Observe that 2 is an even prime number. | 


Admittedly, this last proposition was a bit of an oversimplification. The 
next one is slightly more challenging. 


Proposition There exists an integer that can be expressed as the sum of 
two perfect cubes in two different ways. 


Proof. Consider the number 1729. Note that 1° + 12° = 1729 and 9° + 10? = 
1729. Thus the number 1729 can be expressed as the sum of two perfect 
cubes in two different ways. a 


Sometimes in the proof of an existence statement, a little verification is 
needed to show that the example really does work. For example, the above 
proof would be incomplete if we just asserted that 1729 can be written as a 
sum of two cubes in two ways without showing how this is possible. 


WARNING: Although an example suffices to prove an existence statement, 
a single example does not prove a conditional statement. 


Free PDF version [S)= 


152 Proving Non-Conditional Statements 


Often an existence statement will be embedded inside of a conditional 
statement. Consider the following. (Recall the definition of gcd on page 116.) 


If a,b € N, then there exist integers k and ¢ for which gced(a,b)=ak+bé. 


This is a conditional statement that has the form 
a,beN = Fk,¢leZ, ged(a,b)=ak+bé. 


To prove it with direct proof, we would first assume that a,b « N, then 
prove the existence statement 3k,¢ € Z, gcd(a,b)=ak+bé. That is, we 
would produce two integers k and ¢ (which depend on a and b) for which 
gcd(a,b)=ak+bé. Let’s carry out this plan. (We will use this fundamental 
proposition several times later, so it is given a number.) 


Proposition 7.1 Ifa,b¢«N, then there exist integers k and ¢ for which 
gcd(a,b)=ak+bé. 


Proof. (Direct) Suppose a,b « N. Consider the set A = {ax+by: x,ye Z}. 
This set contains both positive and negative integers, as well as 0. (Reason: 
Let y =0 and let x range over all integers. Then ax+ by = ax ranges over 
all multiples of a, both positive, negative and zero.) Let d be the smallest 
positive element of A. Then, because d is in A, it must have the form 
d=ak+0@ for some specific k, ¢ € Z. 

To finish, we will show d = gced(a,b). We will first argue that d is a 
common divisor of a and 8, and then that it is the greatest common divisor. 

To see that d|a, use the division algorithm (page 30) to write a=qd+r 
for integers g and r with 0<r<d. The equation a=qd+r yields 


r = a-qd 
= a-q(ak+bé) 
= a(l-qk)+b(-q@). 


Therefore r has form r =ax+by, so it belongs to A. But 0<r<d andd is the 
smallest positive number in A, so r can’t be positive; hence r = 0. Updating 
our equation a = qd+r, we get a= qd, sod|a. Repeating this argument 
with 6 = gd+r shows d|b. Thus d is indeed a common divisor of a and b. It 
remains to show that it is the greatest common divisor. 

As ged(a, b) divides a and 6, we have a = gcd(a,b)-m and b = gcd(a, b)-n for 
some m,né Z. Sod =ak+bé = ged(a,b)-mk+gcd(a,b)-nl = ged(a,b)(mk +n), 
and thus d is a multiple of gcd(a,b). Therefore d = gced(a,b). But d can’t be 
a larger common divisor of a and b than gcd(a,b), so d = gcd(a, b). a 


Richard Hammack Book of Proof 


Existence Proofs; Existence and Uniqueness Proofs 153 


We conclude this section with a discussion of so-called uniqueness proofs. 
Some existence statements have form “There is a unique x for which P(x).” 
Such a statement asserts that there is exactly one example x for which P(x) is 
true. To prove it, you must produce an example x =d for which P(d) is true, 
and you must show that d is the only such example. The next proposition 
illustrates this. In essence, it asserts that the set {ax+by:x,yeZ} consists 
precisely of all the multiples of gcd(a,b). 


Proposition Suppose a,beN. Then there exists a unique d € N for which: 
An integer m is a multiple of d if and only if m =ax+by for some x, ye Z. 


Proof. Suppose a,b €N. Let d = ged(a,b). We first show that an integer m is 
a multiple of d if and only if m =ax+by for some x,ye Z. Let m=dn bea 
multiple of d. By Proposition 7.1 (on the previous page), there are integers 
k and ¢ for which d =ak+6@. Then m=dn =(ak+b0)n = a(kn)+ b(én), so 
m=ax+ by for integers x=kn and y= én. 

Conversely, suppose m = ax+by for some x,ye Z. Since d = gced(a,b) is 
a divisor of both a and b, we have a=dc and b =de for some c,ee Z. Then 
m=axt+by=dcex+dey=d(cx+ey), and this is a multiple of d. 

We have now shown that there is a natural number d with the property 
that m is a multiple of d if and only if m = ax+by for some x, ye Z. It remains 
to show that d is the wnique such natural number. To do this, suppose d’ is 
any natural number with the property that d has: 


mis amultiple of d’ <= m=ax+by for some x, ye Z. (7.1) 


We next argue that d’ =d; that is, d is the unique natural number with 
the stated property. Because of (7.1), m=a-1+6-0=a is a multiple of d’. 
Likewise m =a-0+6-1=6 is a multiple of d'. Hence a and b are both 
multiples of d’, so d' is a common divisor of a and b, and therefore 


d'<gced(a,b)=d. 


But also, by (7.1), the multiple m = d'-1=d' of d' can be expressed as 
d'=ax+ by for some x,y€ Z. As noted in the second paragraph of the proof, 
a=dc and b=de for some c,e€ Z. Thus d' =ax+by=dcex+dey=d(cx+ey), 
so d' is a multiple d. As d’ and d are both positive, it follows that 


d<d'. 


We’ve now shown that d'<d and d <d', sod =d'. The proof is complete. 


Free PDF version [S)= 


154 Proving Non-Conditional Statements 


7.4 Constructive Versus Non-Constructive Proofs 


Existence proofs fall into two categories: constructive and non-constructive. 
Constructive proofs display an explicit example that proves the theorem; 
non-constructive proofs prove an example exists without actually giving it. 
We illustrate the difference with two proofs of the same fact: There exist 
irrational numbers x and y (possibly equal) for which x” is rational. 


Proposition There exist irrational numbers x,y for which x” is rational. 


Proof. Let x = ee and y= V2. We know y is irrational, but it is not clear 
whether «x is rational or irrational. On one hand, if x is irrational, then we 
have an irrational number to an irrational power that is rational: 


v2 
ot =[va" | = va = vat =a, 


On the other hand, if x is rational, then y” = ja? =x is rational. Either way, 
we have an irrational number to an irrational power that is rational. 


The above is a classic example of a non-constructive proof. It shows 
that there exist irrational numbers x and y for which x” is rational without 
actually producing (or constructing) an example. It convinces us that one 


of way? or ¥2°” is an irrational number to an irrational power that is 
rational, but it does not say which one is the correct example. It thus proves 
that an example exists without explicitly stating one. 

Next comes a constructive proof of this statement, one that produces 
(or constructs) two explicit irrational numbers x,y for which x” is rational. 


Proposition There exist irrational numbers x,y for which x” is rational. 


Proof. Let x = V2 and y =log,9. Then 


log, 3 
me = glee 3 = 3, 


w= gion? = gira?” 7 Jgrione8 = (v2’) 
As 3 is rational, we have shown that x” = 8 is rational. 

We know that x = V2 is irrational. The proof will be complete if we can 
show that y =logy9 is irrational. Suppose for the sake of contradiction that 
logs 9 is rational, so there are positive integers a and 6 for which ¢ = log, 9. 
This means 2”? = 9, so (2% by? = 9°, which reduces to 27 =9°. But 2° is even, 
while 9° is odd (because it is the product of the odd number 9 with itself b 
times). This is a contradiction; the proof is complete. a 


Richard Hammack Book of Proof 


Constructive Versus Non-Constructive Proofs 155 


This existence proof has inside of it a separate proof (by contradiction) 
that log, 9 is irrational. Such combinations of proof techniques are, of course, 
typical. (Some careful writers would even insert a proof that 9° is odd.) 

Be alert to constructive and non-constructive proofs as you read proofs 
in other books and articles, as well as to the possibility of crafting such 
proofs of your own. 


Exercises for Chapter 7 

Prove the following statements. These exercises are cumulative, covering all 
techniques addressed in Chapters 4—7. 

. Suppose xe Z. Then x is even if and only if 3x+5 is odd. 

. Suppose xeé Z. Then x is odd if and only if 3x +6 is odd. 


24a is even if and only if a is even. 


. Given an integer a, then a? +a 
Given an integer a, then a? + 4a+5 is odd if and only if a is even. 
An integer a is odd if and only if a is odd. 

. Suppose x,yé€R. Then x?+x?y = y?2+xy if and only if y =x? or y=—x. 


. Suppose x,yé€R. Then (x + y)? =x7+ y? if and only if x =0 or y=0. 


. Suppose a,b€ Z. Prove that a= b (mod 10) if and only if a =6 (mod 2) anda=b 
(mod 5). 


. Suppose ae Z. Prove that 14|a if and only if 7|a and 2|a. 
. IfaeZ, then a® =a (mod 3). 
. Suppose a,b€ Z. Prove that (a —3)b? is even if and only if a is odd or b is even. 


ell on Sl 
woe oS © 


. There exists a positive real number x for which x? < \/x. 
. Suppose a,be Z. If a+b is odd, then a? + b? is odd. 
. Suppose ae Z. Then a?|a if and only if ae {-1,0,1}. 


= 
oa - & 


. Suppose a,be Z. Prove that a+b is even if and only if a and b have the same 
parity. 


_ 
for) 


. Suppose a,b Z. If ab is odd, then a? + b? is even. 


—_ 
a] 


. There is a prime number between 90 and 100. 

. There is a set X for which Ne X andNcX. 

. IfneN, then 29+ 2! +22423424+4...49% =gntl_], 

. There exists an n€N for which 11 | (2” —1). 

. Every real solution of x* +x +3-=0 is irrational. 

. IfneZ, then 4|n? or 4|(n?-1). 

. Suppose a,b and ¢ are integers. If a| b and a|(b?—c), then a|c. 
. IfaeZ, then 4} (a?-8). 


NNwNNNSN = = 
PrP wNnNeE © OH OW 


Free PDF version [S)= 


156 Proving Non-Conditional Statements 


25. 


26. 
27. 
28. 


29. 


30. 


31. 
32. 
33. 
34. 


35. 
36. 


If p > 1is an integer and n{p for each integer n for which 2 <n < \/p, then p is 
prime. 

The product of any n consecutive positive integers is divisible by n!. 

Suppose a,b Z. If a? +b? is a perfect square, then a and 6 are not both odd. 


Prove the division algorithm: If a,b € N, there exist unique integers q,r for 
which a = bq+r, and 0<r<b. (A proof of existence is given in Section 1.9, but 
uniqueness needs to be established too.) 


If a| bc and ged(a, b) = 1, then a|c. 
(Suggestion: Use the proposition on page 152.) 


Suppose a,b,p« Z and p is prime. Prove that if p|ab then p|a or p|b. 
(Suggestion: Use the proposition on page 152.) 


If ne Z, then ged(n,n+1)=1. 
If n€ Z, then gced(n,n +2)€e {1,2}. 
If n€ Z, then ged(2n + 1,4n? +1)=1. 


If ged(a,c) = ged(b,c) = 1, then ged(ab,c) = 1. 
(Suggestion: Use the proposition on page 152.) 


Suppose a,beN. Then a = ged(a,b) if and only if a| b. 
Suppose a,beN. Then a =Ilcem(a, bd) if and only if b| a. 


Richard Hammack Book of Proof 


CHAPTER 8 


Proofs Involving Sets 


tudents in their first advanced mathematics classes are often surprised 
by the extensive role that sets play and by the fact that most of the 
proofs they encounter are proofs about sets. Perhaps you’ve already seen 
such proofs in your linear algebra course, where a vector space was defined 
to be a set of objects (called vectors) that obey certain properties. Your 
text proved many things about vector spaces, such as the fact that the 
intersection of two vector spaces is also a vector space, and the proofs used 
ideas from set theory. As you go deeper into mathematics, you will encounter 
more and more ideas, theorems and proofs that involve sets. The purpose 
of this chapter is to give you a foundation that will prepare you for this new 
outlook. 

We will discuss how to show that an object is an element of a set, how to 
prove one set is a subset of another and how to prove two sets are equal. As 
you read this chapter you may need to occasionally refer back to Chapter 1 
to refresh your memory. For your convenience, the main definitions from 
Chapter 1 are summarized below. If A and B are sets, then 


AxB = {(x,y):x€A, ye Bh, 
AUB = {x:(xeA)v(xe BD}, 
ANB = {x:(ce€A)A(xe BY}, 
A-B = {x:(xeEA)A(x¢B)}, 


A = U-A. 


Recall that A <¢ B means that every element of A is also an element of B. 
Also, the power set of A is the set of all subsets of A: 


PA) = {X:XcA}. 


8.1 How to ProveacA 


We will begin with a review of set-builder notation, and then review how to 
show that a given object a is an element of some set A. 


158 Proofs Involving Sets 


Generally, a set A will be expressed in set-builder notation A = {x : P(x)}, 
where P(x) is some open sentence about x. The set A is understood to have 
as elements all those things x for which P(x) is true. For example, 


{x :x is an odd integer} = {..., -—5, -3, -1, 1, 3, 5,...}. 


A common variation of this notation is to express a set as A= {xe S:P(x)}. 
Here it is understood that A consists of all elements x of the (predetermined) 
set S for which P(x) is true. Keep in mind that, depending on context, x 
could be any kind of object (integer, ordered pair, set, function, etc.). There is 
also nothing special about the particular variable x; any reasonable symbol 
x, y, k, etc., would do. Some examples follow. 


{neZ:nisodd} = 

{x EN: 6x} 
{(a,b)eZxZ:b=a+5} 
{Xe AZ) :|X|=1} = 


ag25, 38,1, 858 
6, 12, 18, 24, 30,...} 


., (2,3), (-1,4), (0,5), (1,6), ...} 


1-1, 10}, (Lf, 12h, 13}, 14}, 


Now it should be clear how to prove that an object a belongs to a set {x : P(x)}. 
Since {x :P(x)} consists of all things x for which P(x) is true, to show that 
a € {x : P(x)} we just need to show that P(a) is true. Likewise, to show 
a€ {xe S:P(x)}, we need to confirm that a¢S and that P(a) is true. These 
ideas are summarized below. However, you should not memorize these 
methods, you should understand them. With contemplation and practice, 
using them becomes natural and intuitive. 


{ 
{ 
{ 
{ 


How to show ae {x: P(x)} How to show ae {xc S: P(x)! 


Show that P(a) is true. 1. Verify that ae S. 
2. Show that P(a) is true. 


Example 8.1 Let’s investigate elements of A = {x:x¢N and 7|x}. This set 
has form A = {x : P(x)} where P(x) is the open sentence (x € N) A(7|x). Thus 
21¢€A because P(21) is true. Similarly, 7,14,28,35, etc., are all elements of 
A. But 8¢A (for example) because P(8) is false. Likewise —14¢ A because 
P(-14) is false. 

Example 8.2 Consider the set A = {X « A(N): |X| = 3}. We know that 
{4, 13,45} € A because {4, 13,45} « A(N) and |{4,13,45}| = 3. Also {1,2,3} A, 
{10,854,3} € A, etc. However {1,2,3,4} ¢ A because |{1,2,3,4}| #3. Further, 
{—1,2,3} ¢ A because {-1,2,3}¢ PAN). 


Richard Hammack Book of Proof 


How to Prove ACB 159 


Example 8.3 Consider the set B = {(x,y)<« Zx Z:x = y (mod 5)}. Notice 
(8,23) « B because (8,23) « Zx Z and 8 = 23 (mod 5). Likewise, (100,75) € B, 
(102,77) €B, etc., but (6,10) ¢B. 

Now suppose n € Z and consider the ordered pair (4n+3,9n—2). Does this 
ordered pair belong to B? To answer this, we first observe that 
(4n+38,9n—2)€ ZxZ. Next, we observe that (4n+3)—(9n-2) = —5n+5=5(1-n), 
so 5| ((4n +3)—(9n -2)), which means (4n +3) = (9n —2) (mod 5). Therefore we 
have established that (4n + 3,9n — 2) meets the requirements for belonging 
to B, so (4n+3,9n—-2)¢€B for every ne Z. 


Example 8.4 This illustrates another common way of defining a set. 
Consider the set C = {38x?+2:x eZ}. Elements of this set consist of all the 
values 3x° +2 where x is an integer. Thus —22 € C because —22 = 3(—2)° +2. 
You can confirm —1¢€C and 5€C, etc. Also 0¢C and 5 €C, etc. 


8.2 How to Prove ACB 


In this course (and more importantly, beyond it) you will encounter many 
circumstances where it is necessary to prove that one set is a subset of an- 
other. This section explains how to do this. The methods we discuss should 
improve your skills in both writing your own proofs and in comprehending 
the proofs that you read. 

Recall (Definition 1.3) that if A and B are sets, then A ¢ B means that 
every element of A is also an element of B. In other words, it means ifae A, 
then ac B. Therefore to prove that A ¢ B, we just need to prove that the 
conditional statement 


“If ae A, then a€ B” 


is true. This can be proved directly, by assuming ae A and deducing ae B. 
The contrapositive approach is another option: Assume a ¢ B and deduce 
a¢ A. Each of these two approaches is outlined below. 


How to Prove ACB How to Prove ACB 
(Direct approach) (Contrapositive approach) 
Proof. Suppose ac A. Proof. Suppose a¢B. 
Therefore ae B. Therefore a¢ A. 
Thus ac A implies ac B, Thus a¢B implies a¢A, 
so it follows that ACB. a so it follows that ACB. a 


Free PDF version [S)= 


160 Proofs Involving Sets 


In practice, the direct approach usually yields the most straightforward 
and easy proof, though occasionally the contrapositive is the most expedient. 
(You can even prove A CB by contradiction: Assume (a € A) A(a ¢ B), and 
deduce a contradiction.) The remainder of this section consists of examples 
with occasional commentary. Unless stated otherwise, we will use the direct 
approach in all proofs; pay special attention to how the above outline for 
the direct approach is used. 


Example 8.5 Prove that {x¢Z:18|x}<¢{xeZ:6|x}. 


Proof. Suppose a€ {x€ Z:18|x}. 
This means that ae Z and 18|a. 
By definition of divisibility, there is an integer c for which a = 18c. 
Consequently a = 6(3c), and from this we deduce that 6|a. 
Therefore a is one of the integers that 6 divides, so ae {xe Z:6|x}. 

We’ve shown ae {x € Z:18|x} implies ae {x € Z:6|x}, so it follows that 
{x€Z:18|x}c{xeEZ:6|x}. a 


Example 8.6 Prove that {xe Z:2|x}n{xeZ:9|x}c{xeZ:6|x}. 


Proof. Suppose aeé {xe Z:2|x}n{xeZ:9|x}. 
By definition of intersection, this means ae {xe Z:2|x} andae {xe Z:9|x}. 
Since a€ {x € Z:2|x} we know 2/a, so a = 2c for some ce Z. Thus a is even. 
Since aeé {x € Z:9|x} we know 9|a, so a=9d for some d € Z. 
As a is even, a = 9d implies d is even. (Otherwise a = 9d would be odd.) 
Then d = 2e for some integer e, and we have a = 9d = 9(2e) = 6(3e). 
From a = 6(3e), we conclude 6|a, and this means ae {x€ Z:6|x}. 

We have shown that ae {xe Z:2|x}n{xeZ:9|x} implies ae {xe Z:6|x}, 
so it follows that {xe Z:2|x}n{xeZ:9|x}c{xeZ:6|x}. a 


Example 8.7 Show {(x,y)¢ ZxZ:x =y (mod 6)} € {(x, y)€ ZxZ: x = y (mod 3)}. 


Proof. Suppose (a,b) € {(x,y)€Zx Z:x = y (mod 6)}. 
This means (a,b) € Z x Z and a=b (mod 6). 
Consequently 6|(a — 6), soa—b = 6c for some integer c. 
It follows that a— 6b = 3(2c), and this means 3|(a— b), soa=b (mod 8). 
Thus (a,b)€ {(x,y)€ Zx Z:x= y (mod 3)}. 
We’ve now seen that (a,b) € {(x,y)€ Zx Z:x = y (mod 6)} implies (a,b) € 
{(x,y)€Zx Z:x = y (mod 3)}, so it follows that {(x,y)¢ Zx Z:x = y (mod 6)} ¢ 
{(x,y)€Zx Z:x = y (mod 3)}. a 


Richard Hammack Book of Proof 


How to Prove ACB 161 


Some statements involving subsets are transparent enough that we 
often accept (and use) them without proof. For example, if A and B are any 
sets, then it’s very easy to confirm AN BCA. (Reason: Suppose xe ANB. 
Then x¢€ A and x€ B by definition of intersection, so in particular x € A. 
Thus xe ANB implies xe A, so ANBCA.) Other statements of this nature 
include AG AUB and A-—B¢CA, as well as conditional statements such as 
(A SB)A(BSC))>(ASC) and (X <A) >(X CAUB). Our point of view in 
this text is that we do not need to prove such obvious statements unless 
we are explicitly asked to do so in an exercise. (Still, you should do some 
quick mental proofs to convince yourself that the above statements are true. 
If you don’t see that ANB CA is true but that AC ANB is not necessarily 
true, then you need to spend more time on this topic.) 

The next example will show that if A and B are sets, then Y(A)U A(B)€ 
P(A UB). Before beginning our proof, let’s look at an example to see if this 
statement really makes sense. Suppose A = {1,2} and B = {2,3}. Then 


{DAL} {2h tL 25h VU 1S {2h 13}, 12,33} 
1.1L}, 12}, 13} {1 2f, 12,3} F- 


Also AAAUB) = A({1,2,3}) = {@, {1}, {2}, {8}, {1,2}, {2,3}, {1,3}, {1,2,3}}. Thus, 
even though A(A)U A(B) 4 A(A UB), it is true that A(A)U A(B) c A(AUB) 
for this particular A and B. Now let’s prove A(A)U A(B) < M(A UB) no 
matter what sets A and B are. 


P(A)U AB) 


Example 8.8 Prove that if A and B are sets, then Y(A)U A(B) c A(AUB). 


Proof. Suppose X €« A(A)u A(B). 
By definition of union, this means X € A(A) or X € A(B). 
Therefore X <A or X CB (by definition of power sets). We consider cases. 
Case 1. Suppose X <A. Then X <AUB, and this means X « A(AUB). 
Case 2. Suppose X ¢B. Then X CAUB, and this means X « A(A UB). 
(We do not need to consider the case where X ¢ A and X CB because that 
is taken care of by either of cases 1 or 2.) The above cases show that 
Xe PA(AuB). 

Thus we’ve shown that X « Y(A)U A(B) implies X « A(A UB), and this 
completes the proof that A(A)U A(B)S A(A UB). a 


In our next example, we prove a conditional statement. Direct proof is 
used, and in the process we use our new technique for showing A CB. 


Free PDF version [S)= 


162 Proofs Involving Sets 


Example 8.9 Suppose A and B are sets. If A(A)S AB), then ACB. 


Proof. We use direct proof. Assume A(A)¢ A(B). 

Based on this assumption, we must now show that ACB. 

To show A CB, suppose that ae A. 

Then the one-element set {a} is a subset of A, so {a} € P(A). 

But then, since A(A)¢ A(B), it follows that {a} « A(B). 

This means that {a} ¢B, hence ae B. 

We’ve shown that ae A implies ae B, so therefore A cB. a 


8.3 How to Prove A=B 


In proofs it is often necessary to show that two sets are equal. There is a 
standard way of doing this. Suppose we want to show A =B. If we show 
ACB, then every element of A is also in B, but there is still a possibility 
that B could have some elements that are not in A, so we can’t conclude 
A=B. But if in addition we also show BCA, then B can’t contain anything 
that is not in A, so A=B. This is the standard procedure for proving A = B: 
Prove both ACB and BCA. 


How to Prove A=B 


Proof. 
[Prove that Ac B.] 
[Prove that BC A.] 


Therefore, since ACB and BCA, 
it follows that A =B. a 


Example 8.10 Prove that {n¢Z:35|n}={neZ:5|n}n{neZ:7\n}. 


Proof. First we show {n € Z:35|n}c {ne Z:5|n}n{neZ:7|n}. Suppose 
aé {n€Z:35|n}. This means 35|a, so a = 35c for some ce Z. Thus a = 5(7c) 
and a =7(5c). From a = 5(7c) it follows that 5|a, soae {ne Z:5|n}. From 
a= 7(5c) it follows that 7|a, which means ae¢ {n€ Z:7|n}. As a belongs to 
both {n€Z:5|n} and {ne Z:7|n}, we getace {ne Z:5|n}n{neZ:7|n}. Thus 
we've shown that {n€ Z:35|n}c{neZ:5|n}n{neZ:7|n}. 

Next we show {n€Z:5|n}n{neZ:7|n}<c {ne Z:35\|n}. Suppose that 
a€e {neZ:5|n}n{neZ:7\n}. By definition of intersection, this means that 
ae {neZ:5|n} andae {ne Z:7\|n}. Therefore it follows that 5|a and 7/a. 
By definition of divisibility, there are integers c and d with a = 5c anda =7d. 
Then a has both 5 and 7 as prime factors, so the prime factorization of a 


Richard Hammack Book of Proof 


How to Prove A=B 163 


must include factors of 5 and 7. Hence 5-7 = 35 divides a, soae€ {n€ Z:35|n}. 
We’ve now shown that {ne Z:5|n}n{neZ:7|n}c{neZ:35|n}. 

At this point we’ve shown that {n¢Z:35|n}c{neZ:5|nbn{neZ:7\|n} 
and {ne Z:5|n}n{neZ:7|n}<c {ne Z: 35|n}, so we've proved {n € Z:35|n} = 
{neZ:5|n}n{neZ:7\n}. a 


You know from algebra that if c #0 and ac = bc, then a = b. The next 
example shows that an analogous statement holds for sets A,B and C. The 
example asks us to prove a conditional statement. We will prove it with 
direct proof. In carrying out the process of direct proof, we will have to use 
the new techniques from this section. 


Example 8.11 Suppose A, B, and C are sets, and C# @. Prove that if 
AxC=BxC, then A=B. 


Proof. Suppose A x C=BxC. We must now show A=B. 

First we will show Ac B. Suppose ae A. Since C # @, there exists 
an element ce C. Thus, since ae A and ce C, we have (a,c) « Ax C, by 
definition of the Cartesian product. But then, since A x C = B x C, it follows 
that (a,c)« BxC. But (a,c)e Bx C means a € B, by definition of the Cartesian 
product. We have shown ac A implies ae B, so ACB. 

Next we show BCA. We use the same argument as above, with the roles 
of A and B reversed. Suppose ae B. Since C # @, there exists an element 
céC. Thus, since ae B and ce C, we have (a,c)« Bx C. But then, since 
BxC=AxC, we have (a,c)e AxC. It follows that ac A. We have shown 
aé€BimpliesaeA,soBcA. 

The previous two paragraphs have shown ACB and BCA, so A=B. In 
summary, we have shown that if A x C=BxC, then A=B. This completes 
the proof. Ba 


Now we'll look at another way that set operations are similar to oper- 
ations on numbers. From algebra you are familiar with the distributive 
property a-(b+c)=a-b+a-c. Replace the numbers a,b,c with sets A,B,C, 
and replace - with x and + with n. We get A x (BNC) =(AxB)n(AxC). This 
statement turns out to be true, as we now prove. 


Example 8.12 Given sets A, B and C, prove A x (BNC)=(AxB)n(AxC). 


Proof. First we will show that A x (BNC) <¢(A x B)n(A x C). 

Suppose (a,b)€ A x(BNC). 

By definition of the Cartesian product, this means ae A and be BNC. 
By definition of intersection, it follows that b—¢ B and be C. 


Free PDF version [S)= 


164 Proofs Involving Sets 


Thus, since ac A and be B, it follows that (a,b) e¢ A x B (by definition of x). 
Also, since ae A and be C, it follows that (a,b) «A x C (by definition of x). 
Now we have (a,b)¢ A x B and (a,b)€ Ax C, so (a,b) E(A x B)N(AxC). 
We’ve shown that (a,b) ¢ A x (BNC) implies (a,b) € (A x B)n(A x C) so we have 
Ax(BnC)c(A x B)n(A x C). 

Next we will show that (A x B)n(A x C)G A x(BNC). 
Suppose (a,b)€(A x B)nN(AxC). 
By definition of intersection, this means (a,b)<¢ Ax B and (a,b)E AxC. 
By definition of the Cartesian product, (a,b)¢ A x B means a€A and be B. 
By definition of the Cartesian product, (a,b)¢ Ax C meansaeA and beEC. 
We now have 6€ B and be C, so be BNC, by definition of intersection. 
Thus we’ve deduced that ae A and be BNC, so (a,b)E Ax (BNC). 
In summary, we’ve shown that (a,b) € (Ax B)n(AxC) implies (a,b) €¢ Ax(BNC) 
so we have (A x B)n(AxC)CAx(BNC). 

The previous two paragraphs show that A x(BnNC)<¢(A x B)n(AxC) and 
(Ax B)n(AxC)cAx(BnC), so it follows that (AxB)n(AxC)=Ax(BnC). 


Occasionally you can prove two sets are equal by working out a series 
of equalities leading from one set to the other. This is like showing two 
algebraic expressions are equal by manipulating one until you obtain the 
other. We illustrate this in the following example, which gives an alternate 
solution to the previous example. This approach is sometimes not applicable 
(or awkward), but when it works it can shorten a proof dramatically. 

A quick note before beginning the example. Notice that any statement P 
is logically equivalent to P 1 P. (Write out a truth table if you are in doubt.) 
At one point in the following example we will replace the expression xe A 
with the logically equivalent statement (x € A) A(x € A). 


Example 8.13 Given sets A, B, and C, prove A x(BNC)=(AxB)nN(AxC). 


Proof. Just observe the following sequence of equalities. 


Ax(BNC) = {(x,y): (Ee A)A(yEBNC)} (def. of x) 

= {(x,y) (xe A)A(yEB)A(yEC)} (def. of n) 

= {(x,y) (mE A)A(KEA)A(VEB)A(yEC)} (P=PAP) 

= {(x,y) (we A)A(VEB)) A(EA)A(yEC))} (rearrange) 

= {(x,y) (we A)A(VEB)N{(x,y) (xe A)ACyVEC)} (def. of n) 

= (Ax B)n(AxC) (def. of x) 

This completes the proof. a 


Richard Hammack Book of Proof 


Examples: Perfect Numbers 165 


The equation A x (BNC) =(A x B)n(Ax C) just obtained is a fundamental 
law that you may actually use fairly often as you continue with mathematics. 
Some similar equations are listed below. Each of these can be proved with 
this section’s techniques, and the exercises will ask that you do so. 


AnNB=AUB 
AUB=AnB 
An(BuUC)=(AnB)U(ANC) 
AU(BNC)=(AUB)N(AUC) 
Ax(BUC)=(A~x B)u(A x C) 
Ax(BnC)=(Ax B)n(AxC) 


DeMorgan’s laws for sets 


Distributive laws for sets 


It is very good practice to prove these equations. Depending on your 
learning style, it is probably not necessary to commit them to memory. But 
don’t forget them entirely. They may be useful later in your mathematical 
work. If so, you can look them up or re-derive them on the spot. If you go 
on to study mathematics deeply, you will at some point realize that you’ve 
internalized them without even being cognizant of it. 


8.4 Examples: Perfect Numbers 


Sometimes it takes a good bit of work and creativity to show that one set 
is a subset of another or that they are equal. We illustrate this now with 
examples from number theory involving what are called perfect numbers. 
Even though this topic is quite old, dating back more than 2000 years, it 
leads to some questions that are unanswered even today. 

The problem involves adding up the positive divisors of a natural number. 
To begin the discussion, consider the number 12. If we add up the positive 
divisors of 12 that are less than 12, we obtain 1+2+3+4+6=16, which is 
greater than 12. Doing the same thing for 15, we get 1+3+5=9, which is 
less than 15. For the most part, given a natural number p, the sum of its 
positive divisors less than itself will either be greater than p or less than p. 
But occasionally the divisors add up to exactly p. If this happens, then p is 
said to be a perfect number. 


Definition 8.1 A number p€N is perfect if it equals the sum of its 
positive divisors less than itself. Some examples follow. 


¢ The number 6 is perfect since 6=1+2+3. 
¢ The number 28 is perfect since 28 = 1+2+4+7+14. 
¢ The number 496 is perfect since 496 = 1+2+4+8+16+31+62+124+248. 


Free PDF version [S)= 


166 Proofs Involving Sets 


Though it would take a while to find it by trial-and-error, the next perfect 
number after 496 is 8128. You can check that 8128 is perfect. Its divisors 
are 1, 2, 4, 8, 16, 32, 64, 127, 254, 508, 1016, 2032, 4064 and indeed 


8128 =1+2+4+8+164+32+644+ 127+ 254+ 508+ 1016 + 2032 + 4064. 


Are there other perfect numbers? How can they be found? Do they obey any 
patterns? These questions fascinated the ancient Greek mathematicians. 
In what follows we will develop an idea—recorded by Euclid—that partially 
answers these questions. Euclid lived millennia before set theory was even 
invented, so he certainly did not use sets. Nonetheless we will phrase his 
idea in the language of sets. 

Since our goal is to understand what numbers are perfect, let’s define 
the following set: 

P={peN: pis perfect}. 


Therefore P = {6, 28,496, 8128,...}, but it is unclear what numbers are in P 
other than the ones listed. Our goal is to gain a better understanding of 
just which numbers the set P includes. To do this, we will examine the 
following set A. It looks more complicated than P, but it will be very helpful 
for understanding P, as we will soon see. 


A = {2"-1(2” —1):neN, and 2” —1 is prime} 


In words, A consists of every natural number of form 2”~1(2” — 1), where 
2”-1is prime. To get a feel for what numbers belong to A, look at the 
following table. For each natural number n, it tallies the corresponding 
numbers 2”! and 2”-—1. If 2-1 happens to be prime, then the product 
2”-1(2” — 1) is given; otherwise that entry is labeled with an «. 


n | 2? | 7-1 | a -ta"—-1) 
1] 1 1 x 

2]; 2 3 6 

3) 4 7 28 

4 8 15 * 

5 | 16) 81 496 

6 | 32 | 63 * 

7 | 64 | 127 8128 

8 | 128 | 255 * 

9 | 256 | 511 * 

10 | 512 | 1023 * 

11 | 1024 | 2047 * 

12 | 2048 | 4095 * 

13 | 4096 | 8191 | 33,550,336 


Richard Hammack Book of Proof 


Examples: Perfect Numbers 167 


Notice that the first four entries of A are the perfect numbers 6, 28, 496 
and 8128. At this point you may want to jump to the conclusion that A = P. 
But it is a shocking fact that in over 2000 years no one has ever been able 
to determine whether or not A =P. But it is known that A <P, and we will 
now prove it. In other words, we are going to show that every element of A 
is perfect. (But by itself, that leaves open the possibility that there may be 
some perfect numbers in P that are not in A.) 

The main ingredient for the proof will be the formula for the sum of a 
geometric series with common ratio r. You probably saw this most recently 
in Calculus II. The formula is 


We will need this for the case r = 2, which is 
> sora, (8.1) 


(See the solution for Exercise 19 in Section 7.4 for a proof of this formula.) 
Now we are ready to prove our result. Let’s draw attention to its significance 
by calling it a theorem rather than a proposition. 


Theorem 8.1 If A = {2”-1(2”-1):neN, and 2”—1is prime} and P= 
{peEN: pis perfect}, then ACP. 


Proof. Assume A and P are as stated. To show A CP, we must show that 
pé€A implies pe P. Thus suppose pe A. By definition of A, this means 


peo" =1) (8.2) 


for some ne N for which 2” —1 is prime. We want to show that p € P, that is, 
we want to show p is perfect. Thus, we need to show that the sum of the 
positive divisors of p that are less than p add up to p. Notice that since 2”—1 
is prime, any divisor of p = 2”~1(2” — 1) must have the form 2” or 2”(2” —1) 
for 0<k<n-1. Thus the positive divisors of p are as follows: 


2° 2) 2? pe oe 
igri), BP =1), BO ux. BAG, BC = 1), 


Notice that this list starts with 2° = 1 and ends with 2”-1(2” — 1) = p. 


Free PDF version [S)= 


168 Proofs Involving Sets 


If we add up all these divisors except for the last one (which equals p) 
we get the following: 


Fok + Fate -1) = Fol +(2” — yy 2 

k=0 k=0 k=0 k=0 
= (2”~1)+(2”-12”"'-1) (by Equation (8.1)) 
= [1+(2"1-1]@”-1) 
= gl" =1) 


=p (by Equation (8.2)). 


This shows that the positive divisors of p that are less than p add up to 
p. Therefore p is perfect, by definition of a perfect number. Thus p € P, by 
definition of P. 

We have shown that pe A implies p¢ P, which means ACP. a 


Combined with the chart on the previous page, this theorem gives us 
a new perfect number! The element p = 2!3-1(2!8 — 1) = 33,550,336 in A is 
perfect. 

Observe also that every element of A is a multiple of a power of 2, and 
therefore even. But this does not necessarily mean every perfect number 
is even, because we’ve only shown A CP, not A=P. For all we know there 
may be odd perfect numbers in P—A that are not in A. 

Are there any odd perfect numbers? No one knows. 

In over 2000 years, no one has ever found an odd perfect number, nor 
has anyone been able to prove that there are none. But it is known that the 
set A does contain every even perfect number. This fact was first proved by 
Euler, and we duplicate his reasoning in the next theorem, which proves 
that A = E, where E is the set of all even perfect numbers. It is a good 
example of how to prove two sets are equal. 

For convenience, we are going to use a slightly different definition of a 
perfect number. A number p €N is perfect if its positive divisors add up 
to 2p. For example, the number 6 is perfect since the sum of its divisors is 
1+2+3+6=2-6. This definition is simpler than the first one because we do 
not have to stipulate that we are adding up the divisors that are less than 
p. Instead we add in the last divisor p, and that has the effect of adding an 
additional p, thereby doubling the answer. 


Richard Hammack Book of Proof 


Examples: Perfect Numbers 169 


Theorem 8.2. If A={2”-1(2"-1): neN, and 2”-1 is prime} and E = 
{péN: p is perfect and even}, then A=E. 


Proof. To show that A = E, we need to show ACE and ECA. 

First we will show that A ¢E. Suppose pe A. This means p is even, 
because the definition of A shows that every element of A is a multiple of 
a power of 2. Also, p is a perfect number because Theorem 8.1 states that 
every element of A is also an element of P, hence perfect. Thus p is an even 
perfect number, so p< E. Therefore ACE. 

Next we show that EH <A. Suppose pe E. This means p is an even 
perfect number. Write the prime factorization of p as p = 2°3"15"27"8 ,.., 
where some of the powers n1, nz, ng... may be zero. But, as p is even, the 
power k must be greater than zero. It follows p = 2’q for some positive 
integer k and an odd integer qg. Now, our aim is to show that pe A, which 
means we must show p has form p = 2”~1(2” — 1). To get our current p = 2’q 
closer to this form, let n =k +1, so we now have 


p=2" Iq. (8.3) 


List the positive divisors of g as d1,d2,d3,...,dm. (Where d} =1 and d,» =q.) 
Then the divisors of p are: 


2d 2 do 2°ds ee od. 
od, 21do 21ds 3 24d 
22d, 22 do 22ds oe Pa 
23 dy do 23 d3 ns iy, 
Pdi 28d, Bide: wy. 2 das 


Since p is perfect, these divisors add up to 2p. By Equation (8.3), their sum 
is 2p = 2(2”"1q) =2"q. Adding the divisors column-by-column, we get 


n-1 n-1 n-1 n-1 
> dit DY 2’do+ Y Qedgt--+ > dm = 2%. 
k=0 k=0 k=0 k=0 


Applying Equation (8.1), this becomes 


(2” —1)dy +(2” —1)do + (2” —1)dg +--+ +(2” -—Ddm =2"q 
2" q 


d,+do+d3+-:-+dm= Qn]? 


Free PDF version [S)= 


170 Proofs Involving Sets 


so that 


(Q”-1+Dq_ @"-Da+q_ sg 
= =qt 


di+dg+d3+-:-+dm = ee ee 4: 


From this we see that 54, is an integer. It follows that both g and 54, are 


positive divisors of g. Since their sum equals the sum of all positive divisors 
of q, it follows that q has only two positive divisors, q and 54,. Since one 
of its divisors must be 1, it must be that 54, = 1, which means q = 2” -1. 
Now a number with just two positive divisors is prime, so g = 2”-1 is prime. 
Plugging this into Equation (8.3) gives p = 2”-1(2” — 1), where 2” —1 is prime. 
This means p€ A, by definition of A. We have now shown that p € E implies 
peA,sokca. 


Since ACE and ECA, it follows that A=E. | 


Do not be alarmed if you feel that you wouldn’t have thought of this 
proof. It took the genius of Euler to discover this approach. 
We'll conclude this chapter with some facts about perfect numbers. 


¢ The sixth perfect number is p = 2!7-1(2!" — 1) = 8589869056. 

¢ The seventh perfect number is p = 219-1219 — 1) = 137438691328. 

¢ The eighth perfect number is p = 2°!~1(231 — 1) = 2305843008139952128. 

¢ The twentieth perfect number is p = 24473-1(24428 — 1). It has 2663 digits. 
¢ The twenty-third perfect number p = 2!?!8-1(2!1.218 _ 1) has 6957 digits. 
* The fiftieth perfect number is p = 27%292,917-1(977,282,917 _ 1), 


As mentioned earlier, no one knows whether or not there are any odd 
perfect numbers. It is not even known whether there are finitely many or 
infinitely many perfect numbers. It is known that the last digit of every 
even perfect number is either a 6 or an 8. Perhaps this is something you’d 
enjoy proving. 

We’ve seen that perfect numbers are closely related to prime numbers 
having the form 2”-—1. Such prime numbers are called Mersenne primes, 
after the French scholar Marin Mersenne (1588-1648), who popularized 
them. The first several Mersenne primes are 27—1=3, 2?-1=7, 2°-1=81, 
27-1=127 and 2!°-—1=8191. To date, only 51 Mersenne primes are known, 
the largest of which is 287589933 _1, There is a substantial cash prize for 
anyone who finds a 52nd. (See https://www.mersenne.org/.) You may have 
better luck with the exercises. 


Richard Hammack Book of Proof 


Examples: Perfect Numbers 171 


Exercises for Chapter 8 


Use the methods introduced in this chapter to prove the following statements. 


. Prove that {12n:neZ}¢ {2n:neZ}n{3n:ne Z}. 

. Prove that {6n:neZ} = {2n:ne Z}n{3n:neZ}. 

: IfkeZ, then {neZ:n|k}C{neZ:n|k?}. 

. Ifm,ne Z, then {xe Z:mn|x}c{xeZ:mlxb}n{xeZ:n\|x}. 

. If p and q are positive integers, then {pn:neN}niqn:neN} 4@. 
. Suppose A,B and C are sets. Prove that if ACB, then A-CCB-C. 
. Suppose A,B and C are sets. If BCC, then AxBCAxC. 

. If A,B and C are sets, then AU(BNC)=(AUB)nN(AUC). 

. If A,B and C are sets, then AN(BUC)=(ANB)U(ANC). 

. If A and B are sets in a universal set U, then ANB= AUB. 

. If A and B are sets in a universal set U, then AUB=ANB. 

. If A,B and C are sets, then A-—(BNC)=(A-B)U(A-C). 

. If A,B and C are sets, then A-—(BUC)=(A-B)n(A-C). 

. If A,B and C are sets, then (AUB)—-C =(A—C)U(B-C). 

. If A,B and C are sets, then (An B)-—C =(A-—C)n(B-C). 

. If A,B and C are sets, then A x (BUC) =(AxB)U(AxC). 

. If A,B and C are sets, then A x (BnNC)=(AxB)n(AxC). 

. If A,B and C are sets, then A x (B-C)=(AxB)-(AxC). 

. Prove that {9" :neéZ}¢ {3”:ne Z}, but {9" :ne Z} Z {3" ne Z}. 

. Prove that {9” :n€ Q} = {3" :ne Qh}. 

. Suppose A and B are sets. Prove A CB if and only if A-B=@. 

. Let A and B be sets. Prove that A&B if and only if ANB=A. 

. For each aR, let Ag = {(x, a(x? -1))€ R?: xe R}. Prove that () Aa = {(-1,0),(1,0)}. 


aceR 


eomnNntaonannrhk BW NH = 


— 
rR ONY = © 


_ 
oO 


_ 
for) 


Now N NWN | = = 
one © 6 ®W aI 


iw) 
~ 


. Prove that (][3—x?,5+ x7] =[8,5]. 


xeR 


. Suppose A,B,C and D are sets. Prove that (A x B)U(C x D) S(AUC)x (BUD). 
. Prove that {4k+5:keZ}={4k+1:keZ}. 

. Prove that {12a+4b:a,beZ}= {4c:ceZ}. 

. Prove that {12a+25b:a,beZ}=Z. 

. Suppose A # @. Prove that Ax BC AxC ifand only if BCC. 

. Prove that (Z x N)n (Nx Z)=NxN. 

. Suppose B4% and AxBcBxC. Prove that ACC. 


ow nenbdswnwnpe we 
mowemnn an 


Free PDF version [S)= 


CHAPTER 9 


Disproof 


| ee since Chapter 4 we have dealt with one major theme: Given a 
statement, prove that is it true. In every example and exercise we 
were handed a true statement and charged with the task of proving it. Have 
you ever wondered what would happen if you were given a false statement 
to prove? The answer is that no (correct) proof would be possible, for if it 
were, the statement would be true, not false. 

But how would you convince someone that a statement is false? The 
mere fact that you could not produce a proof does not automatically mean 
the statement is false, for you know (perhaps all too well) that proofs can 
be difficult to construct. It turns out that there is a very simple and utterly 
convincing procedure that proves a statement is false. The process of carry- 
ing out this procedure is called disproof. Thus, this chapter is concerned 
with disproving statements. 

Before describing the new method, we will set the stage with some 
relevant background information. First, we point out that mathematical 
statements can be divided into three categories, described below. 

One category consists of all those statements that have been proved to be 
true. For the most part we regard these statements as significant enough to 
be designated with special names such as “theorem,” “proposition,” “lemma” 
and “corollary.” Some examples of statements in this category are listed 
in the left-hand box in the diagram on the following page. There are also 
some wholly uninteresting statements (such as 2 = 2) in this category, and 
although we acknowledge their existence we certainly do not dignify them 
with terms such as “theorem” or “proposition.” 

At the other extreme is a category consisting of statements that are 
known to be false. Examples are listed in the box on the right. Since 
mathematicians are not very interested in them, these types of statements 
do not get any special names, other than the blanket term “false statement.” 

But there is a third (and quite interesting) category between these 
two extremes. It consists of statements whose truth or falsity has not 
been determined. Examples include things like “Every perfect number 


173 


a? 


is even,” or “Every even integer greater than 2 is the sum of two primes. 
(The latter statement is called the Goldbach conjecture. See Section 2.1.) 
Mathematicians have a special name for the statements in this category 
that they suspect (but haven’t yet proved) are true. Such statements are 


called conjectures. 


THREE TYPES OF STATEMENTS: 


Known to be true 
(Theorems & propositions) 


Truth unknown 


(Conjectures) 


Known to be false 


Examples: 


e Pythagorean theorem 


Fermat’s last theorem 
(Section 2.1) 


The square of an odd 
number is odd. 


oS | 
The series )" — diverges. 
ha 


Examples: 


All perfect numbers are 
even. 


Any even number greater 
than 2 is the sum of two 
primes. (Goldbach’s 
conjecture, Section 2.1) 


There are infinitely many 


Examples: 


e All prime numbers are 
odd. 

e Some quadratic equations 
have three solutions. 

e 0=1 


e There exist natural 
numbers a,b and c 
for which a® + 6? = c3. 


prime numbers of form 
2” 1, with neN. 


Mathematicians spend much of their time and energy attempting to 
prove or disprove conjectures. (They also expend considerable mental energy 
in creating new conjectures based on collected evidence or intuition.) When 
a conjecture is proved (or disproved) the proof or disproof will typically 
appear in a published paper, provided the conjecture is of sufficient interest. 
If it is proved, the conjecture attains the status of a theorem or proposition. 
If it is disproved, then no one is really very interested in it anymore— 
mathematicians do not care much for false statements. (Though some 
disproved conjectures are viewed as instructive examples or curiosities, 
especially if the conjecture had been considered significant.) 

Most conjectures that mathematicians are interested in are quite 
difficult to prove or disprove. We are not at that level yet. In this text, 
the “conjectures” that you will encounter are the kinds of statements that 
an experienced mathematician would immediately spot as true or false, but 
you may have to do some work before figuring out a proof or disproof. But 
in keeping with the cloud of uncertainty that surrounds conjectures at the 
advanced levels of mathematics, most exercises in this chapter (and many 
beyond it) will ask you to prove or disprove statements without giving any 
hint as to whether they are true or false. Your job will be to decide whether 
or not they are true and to either prove or disprove them. The examples 


Free PDF version 


174 Disproof 


in this chapter will illustrate the processes one typically goes through in 
deciding whether a statement is true or false, and then verifying that it’s 
true or false. 

You know the three major methods of proving a statement: direct proof, 
contrapositive proof and proof by contradiction. Now we are ready to under- 
stand the method of disproving a statement. Suppose you want to disprove 
a statement P. In other words you want to prove that P is false. The way to 
do this is to prove that ~ P is true, for if ~ P is true, it follows immediately 
that P has to be false. 


How to disprove P: Prove ~P. 


Our approach is incredibly simple. To disprove P, prove ~ P. In theory, 
this proof can be carried out by the direct, contrapositive or contradiction 
approaches. However, in practice things can be even easier than that if we 
are disproving a universally quantified statement or a conditional statement. 
That is our next topic. 


9.1 Disproving Universal Statements: Counterexamples 


A conjecture may be described as a statement that we hope is a theorem. 
As we know, many theorems (hence many conjectures) are universally quan- 
tified statements. Thus it seems reasonable to begin our discussion by 
investigating how to disprove a universally quantified statement such as 


VxeS,P(x). 
To disprove this statement, we must prove its negation. Its negation is 
~(WxeES,P(x)) = AxeS,~ P(x). 


The negation is an existence statement. To prove the negation is true, we 
just need to produce an example of an xe S that makes ~ P(x) true, that is, 
an x that makes P(x) false. This leads to the following outline for disproving 
a universally quantified statement. 


How to disprove Vx «5S, P(x). 


Produce an example of anxeS 
that makes P(x) false. 


Richard Hammack Book of Proof 


Counterexamples 175 


Things are just as simple if we want to disprove a conditional statement 
P(x) > Q(x). This statement asserts that for every x that makes P(x) true, 
Q(x) will also be true. The statement can only be false if there is an x that 
makes P(x) true and Q(x) false. This leads to our next outline for disproof. 


How to disprove P(x) > Q(x). 


Produce an example of an x that 
makes P(x) true and Q(x) false. 


In both of the above outlines, the statement is disproved simply by 
exhibiting an example that shows the statement is not always true. (Think 
of it as an example that exposes the statement as a promise that can be 
broken.) There is a special name for an example that disproves a statement: 
It is called a counterexample. 


Example 9.1 As our first example, we will work through the process of 
deciding whether or not the following conjecture is true. 
Conjecture For every n€ Z, the integer f(n) =n? —n+11 is prime. 


In resolving the truth or falsity of a conjecture, it’s a good idea to gather 
as much information about the conjecture as possible. In this case let’s 
start by making a table that tallies the values of f(n) for some integers n. 


n | -8 —2 -1 0 1 2 3 4 5 6 7 8 9 10 


fin) | 23 17 13 #11 #11 «#138 «217 «23 «#31 #41 «#53 #67 #83 101 


In every case, f(n) is prime, so you may begin to suspect that the conjecture 
is true. Before attempting a proof, let’s try one more n. Unfortunately, 
f(11) = 112-114+11= 11? is not prime. The conjecture is false because n = 11 
is a counterexample. We summarize our disproof as follows: 


Disproof. The statement “For every n« Z, the integer f(n) = n2—n+11 is 
prime,” is false. For a counterexample, note that for n = 11, the integer 
f(11) = 121=11-11 is not prime. a 


In disproving a statement with a counterexample, it is important to explain 
exactly how the counterexample makes the statement false. Our work would 
not have been complete if we had just said “for a counterexample, consider 
n= 11,” and left it at that. We need to show that the answer f(11) is not 
prime. Showing the factorization f(11) = 11-11 suffices for this. 


Free PDF version [S)= 


176 Disproof 


Example 9.2 Either prove or disprove the following conjecture. 


Conjecture If A, Band C are sets, then A-(BNC)=(A-B)n(A-C). 


Disproof. This conjecture is false because of the following counterexample. 
Let A = {1,2,3}, B= {1,2} and C = {2,3}. Notice that A-(BnC) = {1,3} and 
(A-B)n(A-C)=¢@, so A-(BNC)4(A-B)N(A-C). a 


(To see where this counterexample came from, draw Venn diagrams for 
A-(BnC) and (A-—B)n(A-—C). You will see that the diagrams are different. 
The numbers 1, 2 and 3 can then be inserted into the regions of the diagrams 
in such a way as to create the above counterexample.) 


9.2 Disproving Existence Statements 


We have seen that we can disprove a universally quantified statement or a 
conditional statement simply by finding a counterexample. Now let’s turn 
to the problem of disproving an existence statement such as 


dxeS, P(x). 


Proving this would involve simply finding an example of an x that makes 
P(x) true. To disprove it, we have to prove its negation ~ (Ax € S,P(x)) = 
VxeS,~ P(x). But this negation is universally quantified. Proving it involves 
showing that ~ P(x) is true for all x € S, and for this an example does not 
suffice. Instead we must use direct, contrapositive or contradiction proof to 
prove the conditional statement “fx ¢.S, then ~ P(x).” As an example, here 
is a conjecture to either prove or disprove. 


Example 9.3 Either prove or disprove the following conjecture. 


Conjecture There is a real number x for which «* <x <x?. 


This may not seem like an unreasonable statement at first glance. After all, 
if the statement were asserting the existence of a real number for which 
x? <x <x, then it would be true: just take x = -2. But it asserts there 
is an x for which x* <x <«x?. When we apply some intelligent guessing to 
locate such an x we run into trouble. If x= 5, then x* <x, but we don’t have 
2. similarly if x = 2, we have x < x? but not x4 <x. Since finding an x 
with x* <x <.«? seems problematic, we may begin to suspect that the given 
statement is false. 

Let’s see if we can disprove it. According to our strategy for disproof, 
to disprove it we must prove its negation. Symbolically, the statement is 


x<Xx 


Richard Hammack Book of Proof 


Disproving Existence Statements 177 


2 


dx € R,x* <x <-x?, so its negation is 


~ (Ax é€R,x* <x <x?) = VxER,~ (x4 <x <x), 


Thus, in words the negation is 


For every real number x, it is not the case that x* <x <x”. 

This can be proved with contradiction, as follows. Suppose for the sake of 
contradiction that there is an x for which x* < x < x?. Then x must be positive 
since it’s greater than the non-negative number x*. Dividing all parts of 
x* <x <x? by the positive number x produces x® < 1<x. Now subtract 1 from 


all parts of x° < 1<x to obtain x° -1<0<x-—1 and reason as follows: 


xe-1 <0< x-1 
(x-1(x7?+x4+1) <0 < (x-1) 
xvtxtl] <O0< 1 

(Division by x —1 did not reverse the inequality < because the second line 
above shows 0<x-—1, that is, x—1 is positive.) Now we have x7+x+1<0, 
which is a contradiction because x being positive forces x7 +x+1>0. 

We summarize our work as follows. 

The statement “There is a real number x for which x* <x <x?” is false 
because we have proved its negation “For every real number x, it is not the 
case that x* <x <x?.” 


299 


As you work the exercises, keep in mind that not every conjecture will be 
false. If one is true, then a disproof is impossible and you must produce a 
proof. Here is an example: 


Example 9.4 Either prove or disprove the following conjecture. 


Conjecture There exist three integers x,y,z, all greater than 1 and no 
two equal, for which x” = y’. 


This conjecture is true. It is an existence statement, so to prove it we 
just need to give an example of three integers x, y,z, all greater than 1 and 
no two equal, so that x” = y*. A proof follows. 


Proposition There exist three integers x,y,z, all greater than 1 and no 
two equal, for which x” = y’. 


Proof. Note that if x =2, y= 16 and z =4, then x” = 2! = (24)4=164=y7. @ 


Free PDF version [S)= 


178 Disproof 


9.3 Disproof by Contradiction 


Contradiction can be a very useful way to disprove a statement. To see how 
this works, suppose we wish to disprove a statement P. We know that to 
disprove P, we must prove ~ P. To prove ~ P with contradiction, we assume 
~~ P is true and deduce a contradiction. But since ~~ P = P, this boils down 
to assuming P is true and deducing a contradiction. Here is an outline: 


How to disprove P with contradiction: 


Assume P is true, and deduce a contradiction. 


To illustrate this, let’s revisit Example 9.3 but do the disproof with 
contradiction. You will notice that the work duplicates much of what we 
did in Example 9.3, but is it much more streamlined because here we do 
not have to negate the conjecture. 


Example 9.5 Disprove the following conjecture. 


Conjecture There is a real number x for which x* <x < x?. 


Disproof. Suppose for the sake of contradiction that this conjecture is true. 
Let x be a real number for which x* < x <x?. Then x is positive, since it is 
greater than the non-negative number «x*. Dividing all parts of x4 <x <x? 
by the positive number x produces x°® < 1<x. Now subtract 1 from all parts 
of x? <1<-x to obtain x? -—1<0<«x-—1 and reason as follows: 


xe-1 <0< x-1 
(x-1Ix?+xe4+) <0 < (x-1) 
xtxt1 <O0< 1 
Now we have x? +x+1<0, which is a contradiction because x is positive. 
Thus the conjecture must be false. a 


Exercises for Chapter 9 


Each of the following statements is either true or false. If a statement is true, prove 
it. If a statement is false, disprove it. These exercises are cumulative, covering all 
topics addressed in Chapters 1-9. 


1. Ifx,yeR, then |x + y| = |x|+|yI. 
2. For every natural number n, the integer 2n? — 4n + 31 is prime. 


3. Ifne Zand n®—-n is even, then n is even. 


Richard Hammack Book of Proof 


Disproof by Contradiction 179 


34. 
35. 


. Suppose f(x) =an+a,x+a9x 


. For every natural number n, the integer n?+17n +17 is prime. 


If A, B,C and D are sets, then (A x B)U(C x D) =(AUC)x (BUD). 
If A, B,C and D are sets, then (A x B)n(C x D) =(AnC)x(BnD). 
If A, B and C are sets, and AxC=BxC, then A=B. 


. If A, B and C are sets, then A—(BUC)=(A-B)U(A-C). 

. If A and B are sets, then A(A)— A(B)¢ A(A—B). 

. If A and B are sets and ANB=@, then A(A)- A(B)S AA-B). 

. Ifa,beEN, thena+b<ab. 

. Ifa,b,ceN and ab, bc and ac all have the same parity, then a,b and c all have 


the same parity. 


. There exists a set X for which RC X and eX. 

. If A and B are sets, then A(A)n A(B) = A(ANB). 

. Every odd integer is the sum of three odd integers. 

. If A and B are finite sets, then |AUB|=|A|+|B|. 

. For all sets A and B, if A-—B=@, then B#@. 

. Ifa,b,ceEN, then at least one of a—b, a+c and b—c is even. 

. For every r,s € Q with r<s, there is an irrational number wu for which r<u<s. 
. There exist prime numbers p and q for which p — q = 1000. 

. There exist prime numbers p and q for which p — q = 97. 

. If p and q are prime numbers for which p < q, then 2p + q? is odd. 

. Ifx,yeRand x? < y?, then x<y. 

. The inequality 2* => x+1 is true for all positive real numbers x. 

. For all a,b,cé Z, if a|bc, then a|b or alc. 

. Suppose A, B and C are sets. If A=B-C, then B=AUC. 

. The equation x? = 2” has three real solutions. 

. Suppose a,be Z. Ifa|b and b|a, then a= b. 

. Ifx,yeR and |x+y|=|x—yI, then y=0. 

. There exist integers a and b for which 42a + 7b = 1. 

. No number (other than 1) appears in Pascal’s triangle more than four times. 
. Ifn,k EN and (%) is a prime number, then k = 1 or k=n-1. 


24...+a,x” is a polynomial of degree 1 or greater, 


and for which each coefficient a; is in N. Then there is a k €N for which the 
integer f(z) is not prime. 

If X <AUB, then X CA or X CB. 

In Chapter 5, Exercise 25 asked you to prove that if 2” —1 is prime, then n is 
prime. Is the converse true? 


Free PDF version [S)= 


CHAPTER 10 


Mathematical Induction 


his chapter explains a powerful proof technique called mathematical 

induction (or just induction for short). To motivate the discussion, 

let’s first examine the kinds of statements that induction is used to prove. 
Consider this statement: 


Conjecture The sum of the first n odd natural numbers equals n?. 


The following table illustrates what this conjecture says. Each row is 
headed by a natural number n, followed by the sum of the first n odd natural 
numbers, followed by n?. 


n | sum of the first n odd natural numbers | n? 
bel is gree eee arene are are ere eee Se eR RED 1 
77a el es nea ag cree ne one ne 

8] L484 B= Loe ccc cece eee 9 
A] L484 547TH coe ccc ccc cee cence eens 16 
5 | 148454749 ooo ccc eens 25 
n|1+34+5+74+9+114+---+(Q2n-LD=...... n2 


Note that in the first five lines of the table, the sum of the first n odd 
numbers really does add up to n?. Notice also that these first five lines 
indicate that the nth odd natural number (the last number in each sum) 
is 2n-—1. (For instance, when n = 2, the second odd natural number is 
2-2—1=8; when n = 3, the third odd natural number is 2:3—1=5, etc.) 

The table raises a question. Does the sum 1+3+5+7+---+(2n-1) really 
always equal n?? In other words, is the conjecture true? 

Let’s rephrase this. For each natural number n (i.e., for each line of the 
table), we have a statement S,, as follows: 


181 


S,:1=12 
So:1+3=2? 
S3:14+34+5=3? 


Sn 214+34+54+74+-:-+(2n-1)=n? 


Our question is: Are all of these statements true? 

Mathematical induction answers just this kind of question, where we 
have an infinite list of statements Sj, So, S3,... that we want to prove true. 
The method is really quite simple. To visualize it, think of the statements 
as dominoes, lined up in a row. Suppose you can prove the first statement 
S;, and symbolize this as domino S; being knocked down. Also, say you can 
prove that any statement S;, being true (falling) forces the next statement 
Sz+1 to be true (to fall). Then S; falls, knocking down S2. Next Sg falls, 
knocking down S3, then S3 knocks down S4, and so on. The inescapable 
conclusion is that all the statements are knocked down (proved true). 


The Simple Idea Behind Mathematical Induction 


Then all must fall (all are proved true). 


Free PDF version [S)= 


182 Mathematical Induction 


10.1 Proof by Induction 


This domino analogy motivates an outline for our next major proof technique: 
proof by mathematical induction. 


Outline for Proof by Induction 


Proposition The statements S1, S2, S3, S4,... are all true. 


Proof. (Induction) 

(1) Prove that the first statement S, is true. 

(2) Given any integer k = 1, prove that the statement S; > Sz,, is true. 
It follows by mathematical induction that every S,, is true. a 


In this setup, the first step (1) is called the basis step. Because S, is 
usually a very simple statement, the basis step is often quite easy to do. The 
second step (2) is called the inductive step. In the inductive step direct 
proof is most often used to prove Sz > Sz+1, so this step is usually carried 
out by assuming S, is true and showing this forces S;,1 to be true. The 
assumption that S; is true is called the inductive hypothesis. 

Now let’s apply this technique to our original conjecture that the sum of 
the first n odd natural numbers equals n?. Our goal is to show that for each 
neéN, the statement S,:1+3+5+7+-:-+(2n-1)=n? is true. Before getting 
started, observe that S; is obtained from S, by plugging k in for n. Thus 
S;, is the statement S;:1+3+5+7+---+(2k—-1)=k?. Also, we get Sz41 by 
plugging in k+1 for n, so that Spi1:14+34+54+7+---+(2(k+1)-1)=(k+1). 
Proposition IfneéN, then 1+3+5+7+---+(2n-1)=n?. 

Proof. We will prove this with mathematical induction. 

(1) Observe that if n =1, this statement is 1 = 1?, which is obviously true. 

(2) We must now prove S; > S;,1 for any k= 1. That is, we must show that 
if 1434+54+7+---+(2k-1)=k?, then 14+34+5+7+---+(2(k+1)-1)=(k+ 1. 
We use direct proof. Suppose 1+3+5+7+---+(2k—-1)=k?. Then 


gee eg +(2k+1)-1) = 
14+34+5+7+---+ (2k-1)+({Rk+1)-1) = 
(1+3+5+7+---+(2k-1))+(Qk+1)-1) = 
hk? +(2k+1)-1) = k?+2k4+1 
= (k+1). 


Thus 1+3+5+7+---+(2k+1)-—1)=(k+1)?. This proves that S; > S41. 
It follows by induction that 1+3+5+7+---+(2n-1)=n? for everyneN. @ 


Richard Hammack Book of Proof 


Proof by Induction 183 


In induction proofs it is usually the case that the first statement S; is 
indexed by the natural number 1, but this need not always be so. Depending 
on the problem, the first statement could be So, or S,, for any other integer m. 
In the next example So,S1,S2,S3,... are the statements. The same outline 
is used, except that the basis step verifies So, not $1. 


Proposition If n is a non-negative integer, then 5 | (n° —n). 


Proof. We will prove this with mathematical induction. Observe that the 
first non-negative integer is 0, so the basis step involves n = 0. 


(1) Ifn=0, this statement is 5|(0°—0) or 5|0, which is obviously true. 


(2) Let k=0. We need to prove that if 5|(k° —k), then 5|((k+1)°-(k+1)). 
We use direct proof. Suppose 5|(k°—k). Thus k°—k =5a for some ae Z. 
Observe that 


(k+1)?-(k+1) = h°+5k4+10R°+10k7+5k+1-k-1 
= (k>—k)+5k*+ 10k? + 10k2+5k 
= 5Ba+5k*+10k? +10k2+5k 
= 5(a+k*+2k?+2k? +k). 


This shows (k+1)°—-(k+1) is an integer multiple of 5, so 5 | (k+1)°—(k+1)). 
We have now shown that 5|(k°—) implies 5 | ((k + 1)°—(k + 1)). 


It follows by induction that 5 |(n°—7n) for all non-negative integers n. a 


As noted, induction is used to prove statements of the form Vn eEN,S,y. 
But notice the outline does not work for statements of form Vn « Z,S,, (where 
nisin Z, not N). The reason is that if you are trying to prove Vn é€ Z,S, by 
induction, and you’ve shown Sj is true and S; > Sz41, then it only follows 
from this that S, is true for n =1. You haven’t proved that any of the 
statements So,S_1,S_2,... are true. If you ever want to prove Vneé Z,S, 
by induction, you have to show that some S, is true and Sz; > S;4; and 
S;, => Sp-1.- 

Unfortunately, the term mathematical induction is sometimes confused 
with inductive reasoning, which is the process of reaching the conclusion 
that something is likely to be true based on prior observations of similar 
circumstances. Please note that mathematical induction—as introduced in 
this chapter—is a rigorous technique that proves statements with absolute 
certainty. 

To round out this section, we present four additional induction proofs. 


Free PDF version [S)= 


184 Mathematical Induction 


n 
Proposition Ifne¢Zandn=0, then ))i-i!=(n+1)!-1. 
i=0 


Proof. We will prove this with mathematical induction. 
0 
(1) If n =0, this statement is > i-i!=(0+1)!-1. The left-hand side is 
i=0 
0-0! =0, and the right-hand side is 1!-1=0. Thus the equation holds, 
as both sides are zero. 


(2) Consider any integer k => 0. We must show that S; implies S;,,. That 
is, we must show that 


k k+1 
i-i!=(kR+1)!-1 implies Y i-tt=(R4+1)+1)!-1. 
i=0 i=0 


l= 


k 
We use direct proof. Suppose )"i-i! =(&+1)!-1. Observe that 
i=0 


k+1 k 
ews [5-1] «ce nen) 
i=0 i=0 
= (H+ D!-1)+@+DE+D! 
= (k+I!+(R+DR+V!-1 
= (1+(k+1))(R+1)!-1 
= (k+2\(k+1)!-1 
= (k+2)!-1 
= ((k+1)+D)!-1. 
k+1 


Therefore ) i-i! = (k+1)+D!-1. 
i=0 


n 

We have now proved by induction that )/i-i!=(n+1!-1 for every integer 
i=0 

n=0. | 


In our outline for proof by induction, the inductive step (2) involved 
proving S; > S;41. Obviously, you can prove S, > Sy+1 instead. (That 
is, assume the statement is true for n, and show that it is true for n +1.) 
Sometimes proving S;,-1 > S, is more convenient, and this too is valid. 
The proofs in the following examples will use the scheme S; > Sz, but 
some solutions to odd-numbered exercises will be phrased as S, > Sy+1 or 
Sn-1 > S,. Fluency comes with reading and practice. 


Richard Hammack Book of Proof 


Proof by Induction 185 


The next example illustrates a trick that is occasionally useful. You 
know that you can add equal quantities to both sides of an equation without 
violating equality. But don’t forget that you can add unequal quantities to 
both sides of an inequality, as long as the quantity added to the bigger side 
is bigger than the quantity added to the smaller side. For example, if x < y 
anda <b, thenx+a<yt+tb. Similarly, if x < y and 6 is positive, then x < y+b. 
This oft-neglected fact is used in the next proof. 


Proposition The inequality 2” < 2”*1—2”-1~—1 holds for each neN. 


Proof. We will prove this with mathematical induction. 
(1) If n =1, this statement is 2! < 2!+!-—2!-!_1, and this simplifies to 
2<4-1-1, which is obviously true. 


(2) Say k=1. We use direct proof to show that 2* < 2*+1-—2*-1_1 implies 
ght} < a+ D+1 _ oft _ 1, Suppose 2” < 2*+1-2'-1_1, Then 


2(2*) <= 2(2**-2*-1_1) (multiply both sides by 2) 
oth a ge oF 9 (add 1 to the bigger side) 


okt < gkt2_ok _4 
ok+1 < kt D+ _ o{k+1) 1_y4 


It follows by induction that 2” < 2”*1-—2"-1_-1 for each neN. | 


We next prove that if n € N, then the inequality (1+ x)” =>1+n«x holds for 
all x€R with x > —1. Thus we will need to prove that the statement 


S,:(1+x)”=1+nx for every x€R with x>-1 


is true for every natural number n. This is (only) slightly different from our 
other examples, which proved statements of the form Vn € N, P(n), where 
P(n)is astatement about the number n. This time we are proving something 
of the form 

Wn eN, (vx E(— 1,69), P(n,x)), 


where the open sentence P(n,x):(1+x)” =1+nx involves not only n, but also 
a second variable x. (For the record, the inequality (1+ x)” =>1+nx is known 
as Bernoulli’s inequality.) 


Free PDF version [S)= 


186 Mathematical Induction 


Proposition IfneN, then (1+x)" =>1+ nx for all x€R with x >-1. 


Proof. We will prove this with mathematical induction. 


(1) For the basis step, notice that when n = 1 the statement is (1+)! = 
1+1-x, and this is true because both sides equal 1+ x. 


(2) Assume that for some k = 1, the statement (1+x)* >1+kx is true for 
all x € R with x >—1. From this we need to prove (1+x)**1>1+(k+ Dx. 
Now, 1+. is positive because x > —1, so we can multiply both sides of 
(1+x)* => 1+kx by (1+x) without changing the direction of the =. 


(1+x)*(1¢+x) => (1+kx)(1+x) 
yer 


V 


> ltxthx+kx? 


etl S14 (R41)x+hx? 


(1+x 


(1+x 


The above term kx? is positive, so removing it from the right-hand side 
will only make that side smaller. Thus we get (1+x)*t!>1+(k+1)x. & 


Next, an example where the basis step involves more than routine check- 
ing. (It will be used later, so it is numbered for reference.) 


Proposition 10.1 Suppose aj,qa9,...,a, are n integers, where n= 2. If p 
is prime and p|(a,-a2:a3::-a,), then p|a; for at least one of the aj. 


Proof. The proof is induction on n. 

(1) The basis step involves n = 2. Let p be prime and suppose p | (a;a2). We 
need to show that p | a1 or p|az, or equivalently, if p{ai, then p|agz. 
Thus suppose p{aj. Since p is prime, it follows that gcd(p,a,)=1. By 
Proposition 7.1 (on page 152), there are integers k and ¢ for which 
1=pk+azé. Multiplying this by az gives 


ag = pkag+ajagl. 


As we are assuming that p divides a jag, it is clear that p divides the 
expression pkag +a ,a2f on the right; hence p|az. We’ve now proved 
that if p|(a1a2), then p|a 1 or p|ag. This completes the basis step. 


(2) Suppose that k = 2, and p|(a,-a2g-:--a;) implies then p|a; for some a,j. 
Now let p|(a1-a2---apz-ap+1). Then p| ((a1-a2g---az)-az41). By what we 
proved in the basis step, it follows that p | (a,-ag---az) or p|az41. This 
and the inductive hypothesis imply that p divides one of the a;. a 


Please test your understanding now by working a few exercises. 


Richard Hammack Book of Proof 


Proof by Strong Induction 187 


10.2 Proof by Strong Induction 


Sometimes in an induction proof it is hard to show that S; implies Sz41. 
It may be easier to show some “lower” S,, (with m < k) implies Sz41. 
For such situations there is a slight variant of induction called strong 
induction. Strong induction works just like regular induction, except that 
in Step (2) instead of assuming S; is true and showing this forces S;,1 to 
be true, we assume that all the statements S1, So,...,S, are true and show 
this forces S;,1 to be true. The idea is that if the first k dominoes falling 
always forces the (k + 1)th domino to fall, then all the dominoes must fall. 


Outline for Proof by Strong Induction 


Proposition The statements Sj, So, S3, S4,... are all true. 

Proof. (Strong induction) 

(1) Prove the first statement S;. (Or the first several S,,, if needed.) 
(2) Given any integer k = 1, prove (Sj A S2AS3A-:-A Sz) > Sp41. | 


This is useful when S; does not easily imply S;,1. You may be better served 
by showing some earlier statement (S;_1 or Sz_9, for instance) implies S;,1. 
In strong induction you can use any (or all) of S1, So,..., S, to prove Sz41. 
Here is a classic first example of a strong induction proof: The problem 
is to prove that you can achieve any postage of 8 cents or more, exactly, 
using only 3¢ and 5¢ stamps. For example, for a postage of 47 cents, you 
could use nine 3¢ stamps and four 5¢ stamps. Let S, be the statement 
S,: You can get a postage of exactly n¢ using only 3¢ and 5¢ stamps. Thus 
we need to prove all the statements Sg, S9, S19, S11... are true. In the proof, 
to show S;,1 is true we will need to “go back” three steps from S;41, so the 
basis step involves verifying the first three statements Sg, Sg and Si. 


Proposition Any postage of 8¢ or more is possible using 3¢ and 5¢ stamps. 


Proof. We will use strong induction. 
(1) This holds for postages of 8, 9 and 10 cents: For 8¢, use one 3¢ stamp 
and one 5¢ stamp. For 9¢, three 3¢ stamps. For 10¢, two 5¢ stamps. 


(2) Let k=10, and for each 8< m<k, assume a postage of m cents can be 
obtained exactly with 3¢ and 5¢ stamps. (That is, assume statements 
Sg,So,...,S, are all true.) We must show that S;,1 is true, that is, (k+1)- 
cents postage can be achieved with 3¢ and 5¢ stamps. By assumption, 
S,_2 is true. Thus we can get (k—2)-cents postage with 3¢ and 5¢ stamps. 
Now just add one more 3¢ stamp, and we have (k-2)+3=k+1 cents 
postage with 3¢ and 5¢ stamps. a 


Free PDF version [S)= 


188 Mathematical Induction 


Our next example proves that 12 | (n4—n?) for any n € N. But first, let’s see 
how regular induction is problematic. Regular induction starts by checking 
12| (n+ —n?) for n=1. This reduces to 12|0, which is true. Next we assume 
12 | (k* -k”) and try to show that this implies 12| ((k + 1)4-(k+1)?). Now, 
12 | (k*-—k?) means k* — k? = 12a for some a € Z. We want to use this to get 
(k +1)*-(k+ 1)? = 126 for some integer 6. Working it out, 


(k+1)*-(R+1)2 = (k44+4k2 + 6k? + 4k +1)-(k?2 4+ 2k +1) 
(k* —k2)+ 4k? + 6k? + 2k 
= 12a+4k?+6k24+2k. 


At this point we’re stuck because we can’t factor out a 12. 

Let’s try strong induction. Say S,, is the statement S,,: 12|(n*-n?). 
In strong induction, we assume each of $1, S2,...,S, is true, and show that 
this makes S;,, true. In particular, if S; through S; are true, then Sz_5 is 
true, provided k-521. We will show S;_5 > S,.1 instead of Sz > S41. 
But as k-52=1, we have k=>6. Thus our basis step must check that 
S1, So, S3, Sa, S5, Sg are all true. Once this is done, S;_5 > S;41 will imply 
that any other S; is true. For example, if k = 6, then S;_5 => S;41 is S1 > S7, 
so S7 is true. If k=7, then Sz_5 > Sz11 is So > Sg, So Sg is true, etc. 


Proposition IfneéN, then 12|(n*-n?). 


Proof. We will prove this with strong induction. 
(1) First note that the statement is true for the first six positive integers: 
For n=1, 12 divides 14-12=0. Forn=4, 12 divides 4* — 4? = 240. 
For n = 2, 12 divides 2*-2?=12. Forn=5, 12 divides 54-5? = 600. 
For n = 3, 12 divides 34-37=72. For n=6, 12 divides 6* —6? = 1260. 
(2) For k=6, assume 12|(m*-—m?) for 1<m<k (ie., S1,S9,...,S, are true). 
We must show S;,; is true, that is, 12| ((+ 1)*-(k+1)). Now, S,_5 
being true means 12| ((k —5)*-—(k—5)*). To simplify, put | k-—5 =| so 
12 | (¢4 — ¢?), meaning | ¢4 — ¢? = 12a| for ae Z, and|k+1=¢+6|. Then: 


(k+1)*-(k+1)%? = (€+6)4-(€+6)? 
= 0442407 +2160? + 864¢ + 1296 — (7 + 120 + 36) 
= (¢*—¢7)4+240? +2160? + 852¢ + 1260 
= 12a+240? +2160? + 852¢ + 1260 
= 12(a+2¢3 +1807 +71¢+ 105). 


Because (a + 2¢? + 187 + 71+ 105) € Z, we get 12|((k+1)*-(k+1)?).  @ 


Richard Hammack Book of Proof 


Proof by Strong Induction 189 


Our next example involves mathematical objects called graphs. The 
word graph has two meanings in mathematics. First, there are the graphs 
of equations and functions from algebra and calculus. But here we will be 
interested in the second meaning: A graph is a configuration consisting of 
points (called vertices) and edges, which are lines connecting the vertices. 
Following are pictures of four graphs. Let’s agree that all of our graphs will 
be in “one piece,” that is, you can travel from any vertex of a graph to any 
other vertex by traversing a route of edges from one vertex to the other. 


aR *AN 


Figure 10.1. Examples of Graphs 


A cycle in a graph is a sequence of distinct edges in the graph that form 
a route that ends where it began. For example, the graph on the far left 
of Figure 10.1 has a cycle that starts at vertex v;, then goes to v2, then to 
v3, then v4 and finally back to its starting point v;. You can find cycles in 
both of the graphs on the left, but the two graphs on the right do not have 
cycles. There is a special name for a graph that has no cycles; it is called a 
tree. Thus the two graphs on the right of Figure 10.1 are trees, but the two 
graphs on the left are not trees. Note that a single vertex « has no cycle, so 
it is a tree (with one vertex and zero edges). 

The two trees in Figure 10.1 both have one fewer edge than vertex. The 
tree on the far right has 5 vertices and 4 edges. The one next to it has 6 
vertices and 5 edges. Draw any tree (like the one in Figure 10.2). If it has n 
vertices, then it will have n—1 edges. We now prove that this is always true. 


Figure 10.2. A tree 


Our proof will use the following observation: If we remove an edge from 
a tree (but leave its two endpoints), then the tree is cut into two separate 
graphs, each a tree, and each smaller than the tree we began with. 


Free PDF version C\E 


190 Mathematical Induction 


Proposition Ifa tree has n vertices, then it has n-1 edges. 


Proof. Notice that this theorem asserts that for any n €N, the following 

statement is true: S, :A tree with n vertices has n—1 edges. We use strong 

induction to prove this. 

(1) Observe that if a tree has n = 1 vertex then it has no edges. Thus it has 
n—1=0 edges, so the theorem is true when n = 1. (That is, Sj is true.) 


(2) Now take an integer k = 1. We must show (S;AS2/A---ASz)> Sz41. In 
words, we must show that if it is true that any tree with m vertices 
has m—1 edges, where 1<m<k, then any tree with k +1 vertices has 
(k+1)-1=k edges. We will use direct proof. 


Suppose that for each m with 1<m <k, any tree with m vertices has 
m-—1 edges. Let T be a tree with k+1 vertices. We must show that T 
has (k + 1)—1 edges. Pick an edge of T and label it e, as shown below. 


i 3 ae 


Ty oe < 
Now remove the edge e from T, but leave the two endpoints of e. This 
leaves two smaller trees that we call T; and To. Let’s say T; has x 
vertices and T2 has y vertices. As each of these two smaller trees has 
fewer than k+1 vertices, our inductive hypothesis guarantees that T; 
has x—1 edges, and T, has y—1 edges. Think about our original tree T. 
It has x+y vertices. It has x—1 edges that belong to T; and y—1 edges 
that belong to T2, plus it has the additional edge e that belongs to 
neither T; nor T,. Thus, all together, the number of edges that T has 
is (x-—1)+(y—1)+1=(«+y)-1. In other words, T has one fewer edges 
than it has vertices. Thus T has k +1 vertices and (k + 1)—-1=2 edges. 


It follows by strong induction that a tree with n vertices has n—ledges. @ 


Notice that it was absolutely essential that we used strong induction 
in the above proof because the two trees T; and T»2 will not both have k 
vertices. At least one will have fewer than k vertices. Thus the statement 
S;, is not enough to imply S;,1. We need to use the assumption that S,, will 
be true whenever m <k, and strong induction allows us to do this. 


Richard Hammack Book of Proof 


Proof by Smallest Counterexample 191 


10.3 Proof by Smallest Counterexample 


This section introduces yet another proof technique, called proof by small- 
est counterexample. It is a hybrid of induction and proof by contradiction. 
It has the nice feature that it leads you straight to a contradiction. It is there- 
fore more “automatic” than the proof by contradiction that was introduced 
in Chapter 6. Here is the outline: 


Outline for Proof by Smallest Counterexample 


Proposition The statements Sj, So, S3, S4,... are all true. 


Proof. (Smallest counterexample) 
(1) Check that the first statement Sj is true. 
(2) For the sake of contradiction, suppose not every S,, is true. 


(3) Let k>1 be the smallest integer for which Sz is false. 


(4) Then S;_; is true and S; is false. Use this to get acontradiction. 


This setup leads you to a point where Sz_, is true and S; is false. It is 
here, where true and false collide, that you will find a contradiction. Let’s 
do an example. 


Proposition IfneN, then 4|(5”"—-1). 


Proof. We use proof by smallest counterexample. (We will number the steps 
to match the outline, but that is not usually done in practice.) 


(1) Ifn=1, then the statement is 4|(5!- 1), or 4| 4, which is true. 

(2) For sake of contradiction, suppose it’s not true that 4|(5” —1) for all n. 
(3) Let k>1 be the smallest integer for which 4{(5* — 1). 

(4) Then 4|(5*-!- 1), so there is an integer a for which 5’~!—1=4a. Then 


5e-1_1 = 4a 
5(5*-1-1) = 5-4a 
5'_5 = 20a 
5-1 = 20a+4 
5-1 = A(5a+D. 


This means 4| (5*—1), a contradiction, because 4{(5*—1) in Step 3. Thus, 
we were wrong in Step 2 to assume that it is untrue that 4| (5” —1) for 
every n. Therefore 4|(5” — 1) is true for every n. a 


Free PDF version [S)= 


192 Mathematical Induction 


10.4 The Fundamental Theorem of Arithmetic 


The fundamental theorem of arithmetic states that any integer greater 
than 1 has a unique prime factorization. For example, 12 factors into primes 
as 12=2-2-3, and moreover any factorization of 12 into primes uses exactly 
the primes 2, 2 and 3. Our proof combines the techniques of induction, 
cases, minimum counterexample and the idea of uniqueness of existence 
outlined at the end of Section 7.3. 


Theorem 10.1 (Fundamental Theorem of Arithmetic) Any integer n>1 
has a unique prime factorization. “Unique” means that if n = p1-p2-p3::: pr 
and n =a 1-d9-a3:--a¢ are two prime factorizations of n, then k = @, and the 
primes p; and a; are the same, except that they may be in different orders. 


Proof. Suppose n> 1. We first use strong induction to show that n has a 
prime factorization. For the basis step, if n = 2, it is prime, so it is already 
its own prime factorization. Let n => 2 and assume every integer between 2 
and n (inclusive) has a prime factorization. Consider n+1. If it is prime, 
then it is its own prime factorization. If it is not prime, then it factors as 
n+1=ab witha,b>1. Because a and db are both less than n+ 1 they have 
prime factorizations a = p1:p2-p3--:pp and b=p',-py:ps:--p,. Then 


n+1=ab=(pi1-p2: ps: PRP Po*P3"** Py) 


is a prime factorization of n+1. This competes the proof by strong induction 
that every integer greater than 1 has a prime factorization. 

Next we use proof by smallest counterexample to prove that the prime 
factorization of any n = 2 is unique. If n = 2, then n clearly has only one 
prime factorization, namely itself. Assume for the sake of contradiction that 
there is an n > 2 that has different prime factorizations n = p,-p2-p3---Pz 
and n =a 1-d29-a3---a¢. Assume n is the smallest number with this property. 
From n = pi: p2:p3:*:pr, we see that p1|n, so pi | (a1-agq-a3---a¢). By 
Proposition 10.1 (page 186), p; divides one of the primes a;. As a; is prime, 
we have p; =a;. Dividing n = p1- p2-p3-*:pe = a1°2°a3°:-a¢ by py = aj 
yields 

P2°P3°*PR = 41°AQ°A3Z°**Aj_-1°Aj+1°*' Ae. 


These two factorizations are different, because the two prime factorizations 
of n were different. (Remember: the primes p, and a; are equal, so the 
difference appears in the remaining factors, displayed above.) But also the 
above number pa: p3:::pz is smaller than n, and this contradicts the fact 
that n was the smallest number with two different prime factorizations. 


Richard Hammack Book of Proof 


Fibonacci Numbers 193 


A word of caution about induction and proof by smallest counterexample: 
In proofs in other textbooks or in mathematical papers, it often happens 
that the writer doesn’t tell you up front that these techniques are being 
used. Instead, you will have to read through the proof to glean from context 
what technique is being used. In fact, the same warning applies to all of our 
proof techniques. If you continue with mathematics, you will gradually gain 
through experience the ability to analyze a proof and understand exactly 
what approach is being used when it is not stated explicitly. Frustrations 
await you, but do not be discouraged by them. Frustration is a natural part 
of anything that’s worth doing. 


10.5 Fibonacci Numbers 


Leonardo Pisano, now known as Fibonacci, was a mathematician born 
around 1175 in what is now Italy. His most significant work was a book 
Liber Abaci, which is recognized as a catalyst in medieval Europe’s slow 
transition from Roman numbers to the Hindu-Arabic number system. But 
he is best known today for a number sequence that he described in his book 
and that bears his name. The Fibonacci sequence is 


1, 1, 2, 3, 5, 8, 18, 21, 34, 55, 89, 144, 233, 377,... 


The numbers that appear in this sequence are called Fibonacci numbers. 
The first two numbers are 1 and 1, and thereafter any entry is the sum 
of the previous two entries. For example 3+5 = 8, and 5+8= 13, etc. We 
denote the nth term of this sequence as F,. Thus Fy =1, Fo = 1, F3 = 2, 
F,=38, F7=13 and so on. Notice that the Fibonacci sequence is entirely 
determined by the rules Fy =1, Fg =1, and F, = Fy_1+Fp_2. 

We introduce Fibonacci’s sequence here partly because it is something 
everyone should know about, but also because it is a great source of induction 
problems. This sequence, which appears with surprising frequency in 
nature, is filled with mysterious patterns and hidden structures. Some of 
these structures will be revealed in the examples and exercises. 

We emphasize that the condition F, = F,_-1+Fn_2 (or equivalently F,+1 = 
F,, + F,y-1) is the perfect setup for induction. It suggests that we can de- 
termine something about F,, by looking at earlier terms of the sequence. 
In using induction to prove something about the Fibonacci sequence, you 
should expect to use the equation F,, = F,_1+F,_2 somewhere. 

For our first example we will prove that F?,,-Fy41F,—F? =(-1)” for any 
natural number n. For example, ifn =5 we have F?—F¢F5—F? = 8?-8-5-5? = 
64-40-25 = -1=(-1)°. 


Free PDF version [S)= 


194 Mathematical Induction 


Proposition The Fibonacci sequence obeys F?, , — FniiFn-F?=(-1)". 


Proof. We will prove this with mathematical induction. 

(1) Ifn=1we have Frua7FastFn —F? = F2-F2F,-F? = 12-1-1-12=-1= 
(-1)' = (-1)", so indeed F?,, — Fn41F, —F? =(-1)” is true when n= 1. 

(2) Let keN. Using direct proof, we will show F?, , — Fri1Fp -F? =(-1)* 
implies F?,.—Frs2Fn+1—-F?,,=(-1)"*1. Say F?,,-FritFe-Fe=(-)". 
Next we work out F?,,—F12F%+1-F?,, and show that it equals (-1)**1, 
In doing this we will use the fact F,,9 = F,41+F;. Observe that 


Feo —FreoFesi-Feyy = (Fevit Fr)? -(Pesit FeFevi—Foay 
= Fe, +2FpiiFpt+ Fe -F?,.-FeFnsi— Fei, 
= -F? + Friik,t+F? 
= (Fi. —FriiFe-Fp) 
= -(-1)* (inductive hypothesis) 
= (-1)'(-1* 
= (a1 

Therefore F?, ,—Fr42Fp41—-F?,, =(-)"*". 

It follows by induction that F?, , —Fn4iF,—F? =(-1)" for every n€N. a 


Let’s pause for a moment and think about what the result we just proved 
means. Dividing both sides of F?, , —Fn+iFn—-F? =(-1)” by F? gives 


(Fa) Fr+i 1- (-1)” 
Fy, Bi RR 

For large values of n, the right-hand side is very close to zero, and the 
left-hand side is F,+1/F, plugged into the polynomial x? —x-—1. Thus, as 
n increases, the ratio F,.1/F, approaches a root of x?—x-1=0. By the 
quadratic formula, the roots of x? -—x-1 are 12V6 | As Fy,41/F, > 1, this ratio 


: ops 1+V5 
must be approaching the positive root =. Therefore 


i Pasi 1+ V5 
lim = : 


= 10.1 


For a quick spot check, note that F'44/F13 ~ 1.618025, while 1s = 1.618033. 
Even for the small value n = 13, the numbers match to four decimal places. 


Richard Hammack Book of Proof 


Fibonacci Numbers 195 


The number = sa is sometimes called the golden ratio, and there 
has been much speculation about its occurrence in nature as well as in 
classical art and architecture. One theory holds that the Parthenon and the 
Great Pyramids of Egypt were designed in accordance with this number. 

But we are here concerned with things that can be proved. We close by 
observing how the Fibonacci sequence in many ways resembles a geometric 
sequence. Recall that a geometric sequence with first term a and common 
ratio r has the form 


a, ar, ar’, ar®, ar*, ar®, ar®, ar’, ar®,... 


where any term is obtained by multiplying the previous term by r. In 
general its nth term is G, = ar", and Gn+1/G, =r. Equation (10.1) tells 
us that F,,,1/F, = ®. Thus even though it is not a geometric sequence, the 
Fibonacci sequence tends to behave like a geometric sequence with common 
ratio ®, and the further “out” you go, the higher the resemblance. 


Exercises for Chapter 10 
Prove the following statements with either induction, strong induction or proof by 
smallest counterexample. 


247 


1. Prove that 14+2+3+4+--+n= z for every positive integer n. 


g_ n(n+1)(2n+1) 


2. Prove that 17+27+37+47+---+n for every positive integer n. 


6 
2 2 
+1 
3. Prove that 1° +2°+3°+4°+..-+n? = ae for every positive integer n. 
1 2 
4, IfneN, then 1-24+2-3+3-44+4-54+--+n(n+1)= aaa 


5. IfneN, then 21+27+234+...49" =g"+1_9. 


n 
6. Prove that ))(8i-5)= 4n? — n for every positive integer n. 
i=l 


1)2 
7. IfneN, then LB+2-448-544-64--4n(n 42) = VENA 
1 2 
8. IfneN, then — + ere Me 1 
2! 3! 4! (n+1)! (n+1)! 


9. Prove that 24 | (52” - 1) for every integer n= 0. 
10. Prove that 3 | (52” — 1) for every integer n= 0. 
11. Prove that 3|(n?+5n+6) for every integer n> 0. 
12. Prove that 9|(4°” +8) for every integer n = 0. 
13. Prove that 6|(n?—n) for every integer n> 0. 


Free PDF version [S)= 


196 Mathematical Induction 


14. 
15. 


16. 
17. 


18. 


19. 
20. 
21. 


22. 
23. 


24, 
25. 
26. 


27. 
28. 
29. 


Suppose aé Z. Prove that 5|2”a implies 5|a for any neN. 


ee Cee nee 1 1 
If néeN, then + + + spot = ; 
1-2 2:3 3-4 4-5 n(n +1) nt+1 

Prove that 2” + 1<3” for every positive integer n. 


Suppose Aj,A9,...A, are sets in some universal set U, and n= 2. Prove that 
AiNAgn--NAn = AyUAguU-:-UAy. 
Suppose Aj,A9,...A, are sets in some universal set U, and n= 2. Prove that 
A,UAguU-:-UA, = AyNAgn:::-NAn. 


1 11 1 1 

Prove that —~+—+—+---+7 <2 for every neN. 
1 4 9 n2 n 

Prove that (1+2+3+---+n)?=19+23+33+---4+n° for every nEN. 
Toto). de 1 1 

lfweN; then — foo ne 


(Note: This problem asserts that the sum of the first 2” terms of the harmonic 
series is at least 1+ 7/2. It thus implies that the harmonic series diverges.) 


TFAGN: then (f= (1 s] (2 =) (1 a) [tex eee 
2 4 8 16 Qn 4 gn+i 
Use mathematical induction to prove the binomial theorem (Theorem 3.1 on 


page 92). You may find that you need Equation (3.3) on page 90. 
n 
Prove that )° k(})=n2""' for each natural number n. 
k=l 
Concerning the Fibonacci sequence, prove that F1 + Fo+F3+F4+---+F, =Fnsg-1. 


n 
Concerning the Fibonacci sequence, prove that 3 Fe? = F,F p41. 
k=1 
Concerning the Fibonacci sequence, prove that F, + F3+F5+F7+-:-+Fon-1 = Fon. 
Concerning the Fibonacci sequence, prove that Fo+F4+F6+Fgt+-::+Fon = Fen+1-1. 
The indicated diagonals of Pascal’s triangle sum to Fibonacci numbers. Prove 
that this pattern continues forever. 


Richard Hammack Book of Proof’ 


Fibonacci Numbers 197 


30. 


31. 


32. 


33. 


34. 


35. 
36. 


37. 


38. 


39. 


40. 


41. 
42. 


Here F,, is the nth Fibonacci number. Prove that 
n a n 
( ive } _ ( 1 a } 


F,= 
" V5 


Prove that = =("tt), where 1<r<n. 


Roe, 


Prove that os number of n-digit binary numbers that have no consecutive 1’s is 
the Fibonacci number F,,.2. For example, for n = 2 there are three such numbers 
(00, 01, and 10), and 3=Fo.9=F4. Also, for n =8 there are five such numbers 
(000, 001, 010, 100, 101), and 5 = F342 = Fs. 

Suppose n (infinitely long) straight lines lie on a plane in such a way that no 
two of the lines are parallel, and no three of the lines intersect at a single point. 
Show that this arrangement divides the plane into eae regions. 

grtl _ 


Prove that 31+37+3°+34+.---4+3" = for every neN. 


Prove that if n,k € N, and n is even and & is odd, then (')) is even. 


Prove that if n = 2* —1 for some k € N, then every entry in the nth row of Pascal’s 
triangle is odd. 


Prove that if m,ne€N, then Paley me iia Fa (aaa & 


m+1 m+2 


Prove that 2 (P)(,"4) =(",”) for non-negative integers m,n and p. 


(This equation is from Exercise 7 in Section 3.10. There we were asked to prove 
it by combinatorial proof. Here we are asked to prove it with induction.) 


Prove that a (eC, te) = (nap) for non-negative integers m,n and p. 


(This syGation is from Exercise 8 in Section 3.10. There we were asked to prove 
it by combinatorial proof. Here we are asked to prove it with induction.) 


Use Exercise 38 above to prove that ifn €N, then (5 P+ Gy + Cy feet (ae eae 


(Note that this equality was also proved by combinatorial proof in Section 3.10.) 


If n and k are non-negative integers, then ("5°) + ("47) + ("32) +--+ (24) = (""F")). 


Prove: The nth Fibonacci number F,, is even if and only if 3|n. 


Free PDF version [S)= 


Part IV 


Relations, Functions and 
Cardinality 


CHAPTER 11 


Relations 


n mathematics there are endless ways that two entities can be related 
to each other. Consider the following mathematical statements. 


5<10 5<5 6=% 5| 80 7>4 a#Y 843 
a=b(modn) 6€Z XY 272314 02-1 V2€Z ZEN 


In each case two entities appear on either side of a symbol, and we 
interpret the symbol as expressing some relationship between the two 
entities. Symbols such as <,<,=,|,{,=,>, € and ¢C, etc., are called relations 
because they convey relationships among things. 

Relations are significant. In fact, you would have to admit that there 
would be precious little left of mathematics if we took away all the relations. 
Therefore it is important to have a firm understanding of them, and this 
chapter is intended to develop that understanding. 

Rather than focusing on each relation individually (an impossible task 
anyway since there are infinitely many different relations), we will develop 
a general theory that encompasses all relations. Understanding this 
general theory will give us the conceptual framework and language needed 
to understand and discuss any specific relation. 


11.1 Relations 


Before stating the theoretical definition of a relation, let’s look at a 
motivational example. This example will lead naturally to our definition. 

Consider the set A = {1,2,3,4,5}. (There’s nothing special about this 
particular set; any set of numbers would do for this example.) Elements 
of A can be compared to each other by the symbol “<.” For example, 1 < 4, 
2<8, 2<4, and so on. You have no trouble understanding this because the 
notion of numeric order is so ingrained. But imagine you had to explain it 
to a robot that had an obsession for detail but absolutely no understanding 
of the meaning of (or relationships between) integers. You might consider 
writing down for the robot the following set: 


R = {(1,2), (1,3), (1,4), (1,5), (2,3), (2,4), (2,5), (3,4), (3,5), (4,5) }. 


202 Relations 


The set R encodes the meaning of the < relation for elements in A. An 
ordered pair (a,b) appears in the set if and only if a < 6. If asked whether or 
not it is true that 3 < 4, the robot could look through R until it found the 
ordered pair (3,4); then it would know 3 < 4 is true. If asked about 5 < 2, it 
would see that (5,2) does not appear in R, so 5 <2. The set R, which is a 
subset of A x A, completely describes the relation < for A. 

It may seem simple-minded at first, but this is the central idea of our 
main definition. This definition is general enough to describe not just the 
relation < for the set A = {1,2,3,4,5}, but any relation for any set A. 


Definition 11.1 A relation on a set A is a subset RC Ax A. We often 
abbreviate the statement (x,y)¢ R as xRy. The statement (x,y) ¢R is 
abbreviated as xR y. 


Notice that a relation is a set, so we can use what we know about sets to 
understand and explore relations. But before getting deeper into the theory 
of relations, let’s look at some examples of Definition 11.1. 


Example 11.1 Let A = {1,2,3,4}, and consider the following set: 
R={(1,), (2, 0, (2,2), (3,3), (3,2), (3, 1), (4,4), (4,3), (4,2), 4,D}cAxA. 


The set R is a relation on A, by Definition 11.1. Since (1,1) ¢ R, we have 
1R1. Similarly 2R1 and 2R2, and so on. However, notice that (for example) 
(3,4)¢R, so 3R4. Observe that R is the familiar relation = for the set A. 

Chapter 1 proclaimed that all of mathematics can be described with sets. 
Just look at how successful this program has been! The greater-than-or- 
equal-to relation is now a set R. (We might even express this in the rather 
cryptic form == R.) 


Example 11.2 Let A = {1,2,3,4}, and consider the following set: 
S ={(1,1), (1,3), (3,1), (3,3), (2,2), (2,4), (4,2), (4,4)}CAx A. 


Here we have 191, 193, 482, etc., but 394 and 291. What does S mean? 
Think of it as meaning “has the same parity as.” Thus 181 reads “1 has the 
same parity as 1,” and 482 reads “4 has the same parity as 2.” 


Example 11.3 Consider relations R and S of the previous two examples. 


Note that RNS = {(1, 1), (2,2), (3,3), (3, D, (4,4), (4,2)} ¢AxA is a relation on A. 
The expression x(RS)y means “x > y, and x has the same parity as y.” 


Richard Hammack Book of Proof 


Relations 203 


Example 11.4 Let B= {0,1,2,3,4,5}, and consider the following set: 
U = {(1,3), (3,3), (5,2), (2,5), (4,2)} ©B xB. 


Then U is a relation on B because U <B x B. You may be hard-pressed to 
invent any “meaning” for this particular relation. A relation does not have 
to have any meaning. Any random subset of Bx B is a relation on B, whether 
or not it describes anything familiar. 


Some relations can be described with pictures. For example, we can 
depict the above relation U on B by drawing points labeled by elements of B. 
The statement (x,y) € U is then represented by an arrow pointing from x to y, 
a graphic symbol meaning “x relates to y.” Here is a picture of U: 


0 1 2 
SY 
3 4 5 


The next picture illustrates the relation R on the set A = {a,b,c,d}, where 
xRy means x comes before y in the alphabet. According to Definition 11.1, 
as a set this relation is R = {(a,b),(a,c),(a,d),(b,c),(b,d),(c,d)}. You may feel 
that the picture conveys the relation better than the set does. They are two 
different ways of expressing the same thing. In some instances pictures are 
more convenient than sets for discussing relations. 


ve 


b c 


Although such diagrams can help us visualize relations, they do have 
their limitations. If A and R were infinite, then the diagram would be 
impossible to draw, but the set R might be easily expressed in set-builder 
notation. Here are some examples. 


Example 11.5 Consider the set R = {(x,y)€ Zx Z:x—yeN}CZ~xZ. This 
is the > relation on the set A = Z. It is infinite because there are infinitely 
many ways to have x > y where x and y are integers. 


Example 11.6 The set R = {(x,x):x€R} CR R is the relation = on the 
set R, because xRy means the same thing as x= y. Thus R is a set that 
expresses the notion of equality of real numbers. 


Free PDF version [S)= 


204 Relations 


Exercises for Section 11.1 


1. 


11. 


Let A = {0,1,2,3,4,5}. Write out the relation R that expresses > on A. Then 
illustrate it with a diagram. 


. Let A = {1,2,3,4,5,6}. Write out the relation R that expresses | (divides) on A. 


Then illustrate it with a diagram. 


. Let A = {0,1,2,3,4,5}. Write out the relation R that expresses = on A. Then 


illustrate it with a diagram. 


. Here is a diagram for a relation R on a set A. Write the sets A and R. 


. Here is a diagram for a relation R on a set A. Write the sets A and R. 


0 1 2 


3 4 5 


. Congruence modulo 5 is a relation on the set A = Z. In this relation xR y means 


x=y (mod 5). Write out the set R in set-builder notation. 


. Write the relation < on the set A =Z as a subset R of Z~x Z. This is an infinite 


set, so you will have to use set-builder notation. 


. Let A = {1,2,3,4,5,6}. Observe that 6° Ax A, so R=@ isa relation on A. Draw 


a diagram for this relation. 


. Let A = {1,2,3,4,5,6}. How many different relations are there on the set A? 
. Consider the subset R = (Rx R)- {(x,x):x€R}CRxR. What familiar relation on R 


is this? Explain. 
Given a finite set A, how many different relations are there on A? 


In the following exercises, subsets R of R? = Rx R or Z* = Zx Z are indicated by gray 
shading. In each case, R is a familiar relation on R or Z. State it. 


Richard Hammack Book of Proof 


Properties of Relations 205 


11.2 Properties of Relations 


A relational expression xRy is an open sentence; it is either true or false. 
For example, 5 < 10 is true, and 10<5 is false. (Thus an operation like + 
is not a relation, because, for instance, 5+10 has a numeric value, not a 
T/F value.) Since relational expressions have T/F values, we can combine 
them with logical operators; for example, xRy > yRx is a statement or open 
sentence whose truth or falsity may depend on x and y. 

With this in mind, note that some relations have properties that others 
don’t have. For example, the relation < on Z satisfies x < x for every x€ Z. 
But this is not so for < because x <x is never true. The next definition lays 
out three particularly significant properties that relations may have. 


Definition 11.2 Suppose R is a relation on a set A. 
1. Relation R is reflexive if xRx for every xe A. 
That is, R is reflexive if Vxe A, xRx. 
2. Relation R is symmetric if xRy implies yRx for all x,yeA. 
That is, R is symmetric if Vx,yeA,xRy > yRx. 


3. Relation R is transitive if whenever xRy and yRz, then also xRz. 
That is, R is transitive if Vx,y,z€A,((xRy)A(yRz)) > xRz. 


To illustrate this, let’s consider the set A = Z. Examples of reflexive 
relations on Z include <, =, and |, because x<x,x=x and x|x are all true 
for any x € Z. On the other hand, >, <, 4 and { are not reflexive, for none of 
the statements x <x, x>x,x#x and x{x is ever true. 

The relation 4 is symmetric, for if x 4 y, then surely y 4x also. Also, 
the relation = is symmetric because x = y always implies y= x. 

The relation < is not symmetric, as x < y does not necessarily imply 
y <x. For instance 5 <6 is true, but 6<5 is false. Notice (x < y) > (y <x) is 
true for some x and y (for example, it is true when x = 2 and y = 2), but still 
< is not symmetric because it is not the case that (x < y) > (y <x) is true 
for all integers x and y. 

The relation < is transitive because whenever x < y and y <z, it also 
is true that x < z. Likewise <,=,> and = are all transitive. Examine the 
following table and be sure you understand why it is labeled as it is. 


Relations on Z: | < < = | t{ ¢f 

Reflexive no yes yes yes no _ no 
Symmetric no no yes no no _ yes 
Transitive yes yeS yes yes no _ no 


Free PDF version [S)= 


206 Relations 


Example 11.7 Here A = {b,c,d,e}, and R is the following relation on A: 
R = {(b,b), (b,c), (c,b), (c,c), (d,d), (b,d), (d,b), (c,d), (d,c)}. 


This relation is not reflexive, for although bRb, cRc and dRd, it is not true 
that eRe. For a relation to be reflexive, xRx must be true for all xe A. 

The relation R is symmetric, because whenever we have xR y, it follows 
that yRx too. Observe that bRc and cRb; bRd and dRb; dRc and cRd. Take 
away the ordered pair (c,b) from R, and R is no longer symmetric. 

The relation R is transitive, but it takes some work to check it. We 
must check that the statement (xRy A yRz) => xRz is true for all x,y,zeEA. 
For example, taking x = b, y=c and z=d, we have (bRc AcRd) => bRd, 
which is the true statement (TA T)=> T. Likewise, (bRd \dRc) => bRc is 
the true statement (TA T)=>T. Take note that if x =b, y=e and z=c, 
then (bRe A eRc) => bRc becomes (F AF) => T, which is still true. It’s not 
much fun, but going through all the combinations, you can verify that 
(xRy AyRz) > xRz is true for all choices x,y,z¢ A. (Try at least a few of 
them.) 

The relation R from Example 11.7 has a meaning. You can think of 
xRy as meaning that x and y are both consonants. Thus bRc because b 
and c are both consonants; but bRe because it’s not true that b and e are 
both consonants. Once we look at it this way, it’s immediately clear that R 
has to be transitive. If x and y are both consonants and y and z are both 
consonants, then surely x and z are both consonants. This illustrates a 
point that we will see again later in this section: Knowing the meaning of a 
relation can help us understand it and prove things about it. 

Here is a picture of R. Notice that we can immediately spot several 
properties of R that may not have been so clear from its set description. For 
instance, we see that R is not reflexive because it lacks a loop at e, hence 
eRe. 


sO, 


CR TD 


Figure 11.1. The relation R from Example 11.7 


Richard Hammack Book of Proof 


Properties of Relations 207 


In what follows, we summarize how to spot the various properties of a 
relation from its diagram. Compare these with Figure 11.1. 


A relation is ‘ieraaea 

1.| reflexive if ex . oe x 
: loop at x: 

for each point x ... 


A relation is ... there is also 


symmetric if Nenear ne 
Ww 
2.| whenever thereisan *e— > Xe ey 


y back to x: 
arrow from x to y ... 
A relation is th one 
transitive if y mane y 
an arrow from 
whenever there are 
x to z: 
arrows from x to y % ss we ie 


and y toz... 


(If x =z, this means y ... there is also J 
that if there are a loop from 

arrows from x to y x back to x.) 

and from y tox... Xx 


Consider the bottom diagram in Box 3, above. The transitive property 
demands (xRy A yRx) > xRx. Thus, if xRy and yRx in a transitive relation, 
then also xRx, so there is a loop at x. In this case (yRxAxRy)=> yRy, so 
there will be a loop at y too. 


Although these visual aids can be illuminating, their use is limited 
because many relations are too large and complex to be adequately described 
as diagrams. For example, it would be impossible to draw a diagram for the 
relation = (mod n), where n€N. Such a relation would best be explained in 
a more theoretical (and less visual) way. 

We next prove that = (mod n) is reflexive, symmetric and transitive. 
Obviously we will not glean this from a drawing. Instead we will prove it 
from the properties of = (mod n) and Definition 11.2. Pay attention to this 
example. It illustrates how to prove things about relations. 


Free PDF version [S)= 


208 Relations 


Example 11.8 Prove the following proposition. 


Proposition Let neéN. The relation = (mod n) on the set Z is reflexive, 
symmetric and transitive. 


Proof. First we will show that = (mod n) is reflexive. Take any integer x € Z, 
and observe that n|0, so n |(x—x). By definition of congruence modulo n, we 
have x=x (mod n). This shows x =x (mod n) for every x € Z, so = (mod n) is 
reflexive. 

Next, we will show that = (mod n) is symmetric. For this, we must show 
that for all x,y e€ Z, the condition x = y (mod n) implies that y = x (mod n). 
We use direct proof. Suppose x = y (mod n). Thus n | (x— y) by definition 
of congruence modulo n. Then x— y= na for some ae Z by definition of 
divisibility. Multiplying both sides by —-1 gives y—x = n(-a). Therefore 
n|(y—x), and this means y =x (mod n). We’ve shown that x = y (mod n) 
implies that y =x (mod n), and this means = (mod n) is symmetric. 

Finally we will show that = (mod n) is transitive. For this we must 
show that if x = y (mod n) and y=z (mod n), then x =z (mod n). Again 
we use direct proof. Suppose x = y (mod n) and y=z (mod n). This means 
n|(x-—y) and n|(y-—z). Therefore there are integers a and b for which 
x-y=na and y—z=nb. Adding these two equations, we obtain x—z =na+nb. 
Consequently, x—z = n(a+b), son|(x—z), hence x =z (mod n). This completes 
the proof that = (mod n) is transitive. 

The past three paragraphs have shown that the relation = (mod n) is 
reflexive, symmetric and transitive, so the proof is complete. a 


As you continue with mathematics the reflexive, symmetric and tran- 
sitive properties will take on special significance in a variety of settings. 
In preparation for this, the next section explores further consequences of 
these properties. But first work some of the following exercises. 


Exercises for Section 11.2 


1. Consider the relation R = {(a,a),(b,b),(c,c),(d,d),(a, b),(b,a)} on set A = {a,b,c,d}. 
Is R reflexive? Symmetric? Transitive? If a property does not hold, say why. 

2. Consider the relation R = {(a,b),(a,c),(c,c),(b, b),(c, b),(b, c)} on the set A = {a,b,c}. 
Is R reflexive? Symmetric? Transitive? If a property does not hold, say why. 

3. Consider the relation R = {(a,b),(a,c),(c,b),(b,c)} on the set A = {a,b,c}. Is R 
reflexive? Symmetric? Transitive? If a property does not hold, say why. 


Richard Hammack Book of Proof 


Properties of Relations 209 


4, 


Let A = {a,b,c,d}. Suppose R is the relation 


R = {(a,a),(b,b),(c,c),(d,d),(a,b),(b,a),(a,c),(c,a), 
(a,d),(d,a),(b,c),(c,b),(b,d),(d,b),(c,d),(d,c)}. 


Is R reflexive? Symmetric? Transitive? If a property does not hold, say why. 


. Consider the relation R = {(0,0),(V2,0), (0, V2),(V2, V2)} on R. Is R reflexive? 


Symmetric? Transitive? If a property does not hold, say why. 


. Consider the relation R = {(x,x):x€ Z} on Z. Is this R reflexive? Symmetric? 


Transitive? If a property does not hold, say why. What familiar relation is this? 


. There are 16 possible different relations R on the set A = {a,b}. Describe all of 


them. (A picture for each one will suffice, but don’t forget to label the nodes.) 
Which ones are reflexive? Symmetric? Transitive? 


. Define a relation on Z as xRy if |x—y| <1. Is R reflexive? Symmetric? Transitive? 


If a property does not hold, say why. What familiar relation is this? 


. Define a relation on Z by declaring xRy if and only if x and y have the same 


parity. Is R reflexive? Symmetric? Transitive? If a property does not hold, say 
why. What familiar relation is this? 


. Suppose A Z# @. Since GC Ax A, the set R=@ is a relation on A. Is R reflexive? 


Symmetric? Transitive? If a property does not hold, say why. 


. Let A = {a,b,c,d} and R = {(a,a),(b,b),(c,c),(d,d)}. Is R reflexive? Symmetric? 


Transitive? If a property does not hold, say why. 


. Prove that the relation | (divides) on the set Z is reflexive and transitive. (Use 


Example 11.8 as a guide if you are unsure of how to proceed.) 


. Consider the relation R = {(x,y)€ Rx R:x—yeZ} on R. Prove that this relation is 


reflexive, symmetric and transitive. 


. Suppose R is a symmetric and transitive relation on a set A, and there is an 


element ae A for which aRx for every x¢ A. Prove that R is reflexive. 


. Prove or disprove: If a relation is symmetric and transitive, then it is also 


reflexive. 


. Define a relation R on Z by declaring that xR y if and only if x? = y? (mod 4). 


Prove that R is reflexive, symmetric and transitive. 


. Modifying Exercise 8 (above) slightly, define a relation ~ on Z as x ~ y if and only 


if |x- y| <1. Say whether ~ is reflexive. Is it symmetric? Transitive? 


. The table on page 205 shows that relations on Z may obey various combinations of 


the reflexive, symmetric and transitive properties. In all, there are 2° = 8 possible 
combinations, and the table shows 5 of them. (There is some redundancy, as < 
and | have the same type.) Complete the table by finding examples of relations 
on Z for the three missing combinations. 


Free PDF version [S)= 


210 Relations 


11.8 Equivalence Relations 


The relation = on the set Z (or on any set A) is reflexive, symmetric and 
transitive. There are many other relations that are also reflexive, symmetric 
and transitive. Relations that have all three of these properties occur very 
frequently in mathematics and often play quite significant roles. (For 
instance, this is certainly true of the relation =.) Such relations are given 
a special name. They are called equivalence relations. 


Definition 11.3 A relation R ona set A is an equivalence relation if 
it is reflexive, symmetric and transitive. 


As an example, Figure 11.2 shows four different equivalence relations R,, 
Ro, R3 and R, on the set A = {-1,1, 2,3, 4}. Each one has its own meaning, 
as labeled. For example, in the second row the relation Re literally means 
“has the same parity as.” So 1R23 means “1 has the same parity as 3,” etc. 


Relation R Diagram Equivalence classes 
(see next page) 

“is equal to” (=) OO 9D | p-a, a 9, 

Ry, = {(-1,-,(, D, 2, 2), (8, 3),(4,4)} ad ad (3, {4} 


“has same parity as” 
ae si {-1,1,3}, {2,4} 
Ro = {(-1,-1),(1, 1), (2,2),(3,3),(4,4), KE ( 
(-1,1),(1,-1),(-1,3), (3, -1), 3) (1) 


(1,3), (3, 1), (2, 4), (4, 2)} 


“has same sign as” 


R3 = {(-1,-1),(1, D), (2, 2), (3,3), (4,4), 
(1, 2),(2, 1), (1,3), (3, 1),(1, 4), (4, 1), 
(3,4),(4,3), (2,3), (8, 2),(2,4),(4,2)} 


{-1}, {1,2,3,4} 


“has same parity and sign as” i) 


R4 ad {(-1, —-1),, 1), (2,2), (3,3),(4,4), 


@) 
‘ {-1}, {1,3}, {2,4} 
(1,3), (3, 1), (2,4), (4,2)} C) 


Figure 11.2. Examples of equivalence relations on the set A = {-1, 1, 2, 3, 4} 


Richard Hammack Book of Proof 


Equivalence Relations 211 


The above diagrams make it easy to check that each relation is reflexive, 
symmetric and transitive, i.e., that each is an equivalence relation. For 
example, R; is symmetric because xR,y > yR,x is always true: When x= y 
it becomes T => T (true), and when x # y it becomes F = F (also true). In 
a similar fashion, R, is transitive because (xR1y A yR1z) => xR1z is always 
true: It always works out to one of T>T, F > T or F => F. (Check this.) 

As you can see from the examples in Figure 11.2, equivalence relations 
on a set tend to express some measure of “sameness” among the elements 
of the set, whether it is true equality or something weaker (like having the 
same parity). 

It’s time to introduce an important definition. Whenever you have an 
equivalence relation R ona set A, it divides A into subsets called equivalence 
classes. Here is the definition: 


Definition 11.4 Suppose R is an equivalence relation on a set A. Given 
any element a « A, the equivalence class containing a is the subset 
{x€A:xRa} of A consisting of all the elements of A that relate to a. This 
set is denoted as [a]. Thus the equivalence class containing a is the set 
la]={xeA:xRa}. 


Example 11.9 Consider the relation R; in Figure 11.2. The equivalence 
class containing 2 is the set [2] = {x¢ A:xR,2}. Because in this relation the 
only element that relates to 2 is 2 itself, we have [2] = {2}. Other equivalence 
classes for Rj are [-1]={-1}, [1]= {1}, [3] = {3} and [4]= {4}. Thus this 
relation has five separate equivalence classes. 


Example 11.10 Consider the relation R2 in Figure 11.2. The equivalence 
class containing 2 is the set [2] = {x ¢ A:xR22}. Because only 2 and 4 relate 
to 2, we have [2] = {2,4}. Observe that we also have [4] = {x ¢ A :xRo4} = {2,4}, 
so [2] = [4]. Another equivalence class for Ro is [1] = {xe A: xRo1} = {-1,1,3}. 
In addition, note that [1] = [-1] = [3] = {-1,1,3}. Thus this relation has just 
two equivalence classes, namely {2,4} and {-1,1,3}. 


Example 11.11 The relation R, in Figure 11.2 has three equivalence 
classes. They are [—1] = {— 1} and [1] = [3] = {1,3} and [2] = [4] = {2,4}. 


Don’t be misled by Figure 11.2. It’s important to realize that not every 
equivalence relation can be drawn as a diagram involving nodes and arrows. 
Even the simple relation R = {(x,x):x€R}, which expresses equality in the 
set R, is too big to be drawn. Its picture would involve a point for every real 
number and a loop at each point. Clearly that’s too many points and loops 
to draw. 


Free PDF version [S)= 


212 Relations 


We close this section with several other examples of equivalence relations 
on infinite sets. 


Example 11.12 Let P be the set of all polynomials with real coefficients. 
Define a relation R on P as follows. Given f(x), g(x) P, let f(x)R g(x) mean 
that f(x) and g(x) have the same degree. Thus (x? +3x-4)R (3x? -—2) and 
(x3 + 3x? — 4) R (3x? -2), for example. It takes just a quick mental check to 
see that R is an equivalence relation. (Do it.) It’s easy to describe the 
equivalence classes of R. For example, [3x? + 2] is the set of all polynomials 
that have the same degree as 3x? +2, that is, the set of all polynomials of 
degree 2. We can write this as [3x? + 2] = {ax?+bx+c:a,b,cER,aZ 0}. 


Example 11.13 In Example 11.8 we proved that for a given ne N the 
relation = (mod n) is reflexive, symmetric and transitive. Thus, in our new 
parlance, = (mod n) is an equivalence relation on Z. Consider the case n = 38. 
Let’s find the equivalence classes of the equivalence relation = (mod 8). 
The equivalence class containing 0 seems like a reasonable place to start. 
Observe that 


[0] = {x € Z:x =O0(mod 3)} = 
{x€Z:3|(x-0)} = {xe Z:3|x}={...,-3,0,3,6,9,...}. 


Thus the class [0] consists of all the multiples of 3. (Or, said differently, [0] 
consists of all integers that have a remainder of 0 when divided by 3.) Note 
that [0] = [3] =[6] =[9], etc. The number 1 does not show up in the set [0] so 
let’s next look at the equivalence class [1]: 


[1] = {x€ Z:x=1(mod 3)} = {xe Z:3|(x—-1} ={...,-5,—-2,1,4,7,10,...}. 


The equivalence class [1] consists of all integers that give a remainder of 1 
when divided by 3. The number 2 is in neither of the sets [0] or [1], so we 
next look at the equivalence class [2]: 


[2] = {x € Z:x=2(mod 3)} = {xe Z:3|(x—-2)} = {...,-4,-1,2,5,8,11,...}. 


The equivalence class [2] consists of all integers that give a remainder of 
2 when divided by 3. Observe that any integer is in one of the sets [0], [1] 
or [2], so we have listed all of the equivalence classes. Thus = (mod 3) has 
exactly three equivalence classes, as described above. 

Similarly, you can show that the equivalence relation = (mod n) has n 
equivalence classes [0],[1],[2],..., [n — 1]. 


Richard Hammack Book of Proof 


Equivalence Relations 213 


The idea of an equivalence relation is fundamental. In a very real sense 
you have dealt with equivalence relations for much of your life, without 
being aware of it. In fact your conception of fractions is entwined with an 
intuitive notion of an equivalence relation. To see how this is so, consider 
the set of all fractions, not necessarily reduced: 


F={= : m,neZ,n #0}. 


Interpret this set not as Q, but rather as the set of all possible fractions. 
For example, we consider the fractions 5 and 2 as being distinct (unequal) 
elements of F because their numerators and denominators don’t match. 
Of course 5 and 2 are equal numbers, but they are different fractions, so 
$3 2€F, but 5 # 2 (meaning they are distinct, unequal elements of F). 

Define a relation = on F by saying | = 7 provided that ad = bc. Thus 
5 = 2 because 1-4= 2-2. Similarly, notice that = = 2 because —15-2 = —3-10. 
We have defined = so that ¢ = 5 if and only if ¢ and § are equal numbers, 
so = models your intuitive, ingrained understanding of when two different 
fractions are equal. 

Observe that = is an equivalence relation on the set F of all fractions: It is 
reflexive because for any | € F the equation ab = ba guarantees ¢ = ¢. To see 
that = is symmetric, suppose ¢ = 4. This means ad = bc, so cb = da, which 
implies 5 = 4. Exercise 16 below asks you to confirm that = is transitive. 

This discussion shows that your everyday understanding of equality of 
fractions is an equivalence relation. The equivalence class containing, say, 
2 is the set {22 : n€ Z,n# 0} of all fractions that are numerically equal to 2. 
The takeaway is that you have for years lumped together equal fractions 
into equivalence classes under this equivalence relation. 

Later, when you learned calculus, equivalence relations once again were 
just under the surface. The antiderivative { f(x)dx of a function f(x) was 
defined to be the set of functions F(x)+C whose derivatives are f(x). This 
set is an equivalence class in the set of integrable functions, where two 
functions are related if their difference is a constant. (We are glossing over 
some fine points that will be cleared up in an advanced calculus course.) 

Such examples underscore an important point: Equivalence relations 
arise in many areas of mathematics. This is especially true in the advanced 
realms of mathematics, where equivalence relations are the right tool 
for important constructions, constructions as natural and far-reaching as 
fractions, or antiderivatives. Learning about equivalence relations now 
paves the way to a deeper understanding of later courses, and work. 


Free PDF version [S)= 


214 Relations 


Exercises for Section 11.3 


1. 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


Let A = {1,2,3,4,5,6}, and consider the following equivalence relation on A: 
R = {(1, 1), (2,2), (3,3), (4,4), (5, 5), (6, 6), (2,3), (3, 2), (4,5), (5, 4), (4, 6), (6, 4), (5, 6), (6, 5)}. 
List the equivalence classes of R. 


. Let A = {a,b,c,d,e}. Suppose R is an equivalence relation on A. Suppose R has 


two equivalence classes. Also aRd, bRc and eRd. Write out R as a set. 


. Let A= {a,b,c,d,e}. Suppose R is an equivalence relation on A. Suppose R has 


three equivalence classes. Also aRd and bRc. Write out R as a set. 


. Let A = {a,b,c,d,e}. Suppose R is an equivalence relation on A. Suppose also 


that aRd and bRc, eRa and cRe. How many equivalence classes does R have? 


. There are two different equivalence relations on the set A = {a,b}. Describe them. 


Diagrams will suffice. 


. There are five different equivalence relations on the set A = {a,b,c}. Describe 


them all. Diagrams will suffice. 


. Define a relation R on Z as xRy if and only if 8x—5y is even. Prove R is an 


equivalence relation. Describe its equivalence classes. 


. Define a relation R on Z as «Ry if and only if x2 +? is even. Prove R is an 


equivalence relation. Describe its equivalence classes. 


. Define a relation R on Z as xRy if and only if 4|(x+3y). Prove R is an equivalence 


relation. Describe its equivalence classes. 


Suppose R and S are two equivalence relations on a set A. Prove that RnS 
is also an equivalence relation. (For an example of this, look at Figure 11.2. 
Observe that for the equivalence relations Ro,R3 and R4, we have Ron R3 = R4.) 


Prove or disprove: If R is an equivalence relation on an infinite set A, then R 
has infinitely many equivalence classes. 


Prove or disprove: If R and S are two equivalence relations on a set A, then RUS 
is also an equivalence relation on A. 


Suppose R is an equivalence relation on a finite set A, and every equivalence 
class has the same cardinality m. Express |R| in terms of m and |A|. 


Suppose R is a reflexive and symmetric relation on a finite set A. Define a 
relation S on A by declaring xSy if and only if for some n €N there are el- 
ements x1,%9,...,%, € A satisfying xRx1, x1Rxe, xoRx3, x3Rx4,...,X%n-1Rx,, and 
xnRy. Show that S is an equivalence relation and R¢S. Prove that S is the 
unique smallest equivalence relation on A containing R. 

Suppose R is an equivalence relation on a set A, with four equivalence classes. 
How many different equivalence relations S on A are there for which R ¢ S? 


Show that the relation = defined on page 2138 is transitive. 


Richard Hammack Book of Proof 


Equivalence Classes and Partitions 215 


11.4 Equivalence Classes and Partitions 


This section collects several properties of equivalence classes. 

Our first result proves that [a] =[b] if and only ifaRb. This is useful 
because it assures us that whenever we are in a situation where [a] = [b], we 
also have aRb, and vice versa. Being able to switch back and forth between 
these two pieces of information can be helpful in a variety of situations, 
and you may find yourself using this result a lot. Be sure to notice that 
the proof uses all three properties (reflexive, symmetric and transitive) of 
equivalence relations. Notice also that we have to use some techniques from 
Chapter 8 (Proofs Involving Sets) in dealing with the sets [a] and [6]. 


Theorem 11.1 Suppose R is an equivalence relation on a set A. Suppose 
also that a,b¢ A. Then [a] =[b] if and only if aRb. 


Proof. Suppose [a]=[b]. Note that aRa by the reflexive property of R, so 
aé {xe A:xRa} =[a]=[b]= {xe A:xRb}. But a belonging to {x ¢ A: xRb} 
means aRb. This completes the first part of the if-and-only-if proof. 

Conversely, suppose aRb. We need to show [a] =[b]. We will do this by 
showing [a] <[b] and [b] ¢ [a]. 

First we show [a] ¢[b]. Suppose cé [a]. As ce [a] = {xe A:xRa}, we get 
cRa. Now we have cRa and aRb, so cRb because R is transitive. But cRb 
implies ce {x¢ A:xRb}=[b]. This demonstrates that c € [a] implies c € [bd], 
so [a] c[b]. 

Next we show [b] ¢[a]. Suppose ceé[b]. As ce [b] = {xe A:xRb}, we get 
cRb. Remember that we are assuming aRb, so bRa because R is symmetric. 
Now we have cRb and bRa, so cRa because R is transitive. But cRa implies 
cé{xe€A:xRa}=[a]. This demonstrates that c € [b] implies c € [a]; hence 
[b] < [a]. 

The previous two paragraphs imply that [a] = [0d]. | 


To illustrate Theorem 11.1, recall that in Example 11.13 we worked out 
the equivalence classes of = (mod 3). We observed that 


[-3]=[9]={...,-3, 0,3, 6,9,...}. 


Note that [—3] = [9] and —3 = 9 (mod 3), just as Theorem 11.1 predicts. The 
theorem assures us that this will work for any equivalence relation. In 
the future you may find yourself using the result of Theorem 11.1 often. 
Over time it may become natural and familiar; you will use it automatically, 
without even thinking of it as a theorem. 


Free PDF version [S)= 


216 Relations 


Our next topic addresses the fact that an equivalence relation on a set 
A divides A into various equivalence classes. There is a special word for 
this kind of situation. We address it now, as you are likely to encounter it 
in subsequent mathematics classes. 


Definition 11.5 A partition of a set A is a set of non-empty subsets 
of A, such that the union of all the subsets equals A, and the intersection 
of any two different subsets is @. 


Example 11.14 Let A = {a,b,c,d}. One partition of A is {{a,b}, {c}, {d}}. 
This is a set of three subsets {a,b}, {c} and {d} of A. The union of the three 
subsets equals A; the intersection of any two subsets is @. 

Other partitions of A are 


{a,b} ted}}, (iach tO} iat}, — taf {ob teh tah}, tla, be. d}}, 


to name a few. Intuitively, a partition is just a dividing up of A into pieces. 


Example 11.15 Consider the equivalence relations in Figure 11.2. Each 
of these is a relation on the set A = {-1,1,2,3,4}. The equivalence classes of 
each relation are listed on the right side of the figure. Observe that, in each 
case, the set of equivalence classes forms a partition of A. For example, the 
relation R; yields the partition {{- 1}, {1}, {2}, {3}, {4}! of A. Likewise the 
equivalence classes of Ry form the partition {{-1,1,3},{2,4}}. 


Example 11.16 Recall that Example 11.13 worked out the equivalence 
classes of the equivalence relation = (mod 3) on the set Z. These equivalence 
classes give the following partition of Z: 


{{..5-8,0,3,6,9,...},{...,-2,1,4,7,10,...},{...,-1,2,5,8,11,...}h. 


We can write it more compactly as {[0],[1],[2]}. 


Our examples and experience suggest that the equivalence classes of an 
equivalence relation on a set form a partition of that set. This is indeed the 
case, and we now prove it. 


Theorem 11.2 Suppose R is an equivalence relation on a set A. Then the 
set {[a]:a¢ A} of equivalence classes of R forms a partition of A. 


Proof. To show that {[a]:a¢ A} is a partition of A we need to show two 
things: We need to show that the union of all the sets [a] equals A, and we 
need to show that if [a] 4[b], then [a]n[b]=@. 


Richard Hammack Book of Proof 


Equivalence Classes and Partitions 217 


Notationally, the union of all the sets [a] is Uccala], so we need to prove 
Uaeala] = A. Suppose x € Ugeala]. This means x €« [a] for some ae A. Since 
[a] < A, it then follows that x ¢ A. Thus U,eala] ¢ A. On the other hand, 
suppose xe A. As x€ [x], we know x€ [a] for some ae A (namely a = x). 
Therefore x € Ugcala], and this shows A © Ugeala]. Since Ugeala] ¢ A and 
ASUnealal, it follows that U,cala] =A. 

Next we need to show that if [a] 4 [b] then [a]n[b] = @. Let’s use con- 
trapositive proof. Suppose it’s not the case that [a]n[b] = @, so there is 
some element ¢ with c € [a]n[b]. Thus cé€ [a] and ce [b]. Now, ce [a] means 
cRa, and then aRc since R is symmetric. Also c€[b] means cRb. Now we 
have aRc and cRb, so aRb (because R is transitive). By Theorem 11.1, aRb 
implies [a] =[b]. Thus [a] 4 [d] is not true. 

We’ve now shown that the union of all the equivalence classes is A, and 
the intersection of two different equivalence classes is 6. Therefore the set 
of equivalence classes is a partition of A. a 


Theorem 11.2 says the equivalence classes of any equivalence relation 
on a set A form a partition of A. Conversely, any partition of A describes 
an equivalence relation R where xRy if and only if x and y belong to the 
same set in the partition. (See Exercise 4 for this section, below.) Thus 
equivalence relations and partitions are really just two different ways of 
looking at the same thing. In your future mathematical studies you may 
find yourself easily switching between these two points of view. 


Exercises for Section 11.4 


1. List all the partitions of the set A = {a,b}. Compare your answer to the answer 
to Exercise 5 of Section 11.3. 


2. List all the partitions of the set A = {a,b,c}. Compare your answer to the answer 
to Exercise 6 of Section 11.3. 


3. Describe the partition of Z resulting from the equivalence relation = (mod 4). 

4, Suppose P is a partition of a set A. Define a relation R on A by declaring xRy 
if and only if x,y¢ X for some X € P. Prove R is an equivalence relation on A. 
Then prove that P is the set of equivalence classes of R. 

5. Consider the partition P = {{...,-4,-2,0,2,4,...},{....-5,-8,-,1,3,5,...}} of Z. 
Let R be the equivalence relation whose equivalence classes are the two elements 
of P. What familiar equivalence relation is R? 

6. Consider the partition P = {{0}, {-1,1}, {-2,2}, {-3,3}, { 4,4},...} of Z. Describe 
the equivalence relation whose equivalence classes are the elements of P. 


Free PDF version C\E 


218 Relations 


11.5 The Integers Modulo n 


Example 11.8 proved that for any given né€N, the relation = (mod n) is 
reflexive, symmetric and transitive, so it is an equivalence relation. This is 
a particularly significant equivalence relation in mathematics, and in the 
present section we deduce some of its properties. 

To make matters simpler, let’s pick a concrete n, say n=5. Let’s begin 
by looking at the equivalence classes of the relation = (mod 5). There are 
five equivalence classes, as follows: 


[0] = {xe Z:x=0 (mod 5)} = {xe Z:5|(x—0)} = {...,-10, -5, 0,5, 10, 15,...}, 
[1] = {xe Z:x=1 (mod 5)} = {xe Z:5|(x-} ={..., -9,-4, 1,6, 11, 16,...}, 
[2] = {xe Z:x=2 (mod 5)} = {xe Z:5|(x—2)} ={..., -8,-3, 2,7, 12,17,...}, 
[3] = {xe Z:x=3 (mod 5)} = {xe Z:5|(x—3)} ={..., -7,-2,3,8, 13, 18,...}, 
[4] = {xe Z:x=4 (mod 5)} = {xe Z:5|(x-4)} ={..., -6,-1,4,9, 14, 19,...}. 


Notice how these equivalence classes form a partition of the set Z. We 
label the five equivalence classes as [0],[1],[2],[3] and [4], but you know of 
course that there are other ways to label them. For example, [0] = [5] = [10] = 
[15], and so on; and [1] =[6]=[-—4], etc. Still, for this discussion we denote 
the five classes as [0], [1], [2], [3] and [4]. 

These five classes form a set, which we shall denote as Z5. Thus 


Zs = {{01, (11, (21, (31, 141} 


is a set of five sets. The interesting thing about Zs is that even though its 
elements are sets (and not numbers), it is possible to add and multiply them. 
In fact, we can define the following rules that tell how elements of Zs can 
be added and multiplied. 


[a]+[b] = [a+] 
[a]-[b] [a-b] 


For example, [2]+[1] =[2+1]=[3], and [2]-[2] = [2-2] =[4]. We stress that 
in doing this we are adding and multiplying sets (more precisely equivalence 
classes), not numbers. We added (or multiplied) two elements of Zs; and 
obtained another element of Zs. 

Here is a trickier example. Observe that [2]+[8] = [5]. This time we 
added elements [2],[3] « Z;, and got the element [5]¢« Z;. That was easy, 
except where is our answer [5] in the set Z5 = {[0],[1],[2],[3],[4]}? Since 
[5] = [0], it is more appropriate to write [2]+[3] = [0]. 


Richard Hammack Book of Proof 


The Integers Modulo n 219 


In a similar vein, [2]-[3] = [6] would be written as [2]-[3] = [1] because 
[6] =[1]. Test your skill with this by verifying the following addition and 
multiplication tables for Zs. 


+ | [0] [1] [2] [8] [4] - | 10) [11 [2] [3] [4] 


[0] | [0] [1] [2] [3] [4] [0] | [0] [0] [0] [0] [0] 
[1] | [1] [2] [8] [4] [0] (1] | [0] [11 [2] [3] [4] 
[2] | [2] [8] [4] [0] [4] [2] | [0] [2] [4] [1] [8] 
[3] | [3] [4] [0] [1] [2] [3] |} [0] [38] [1] [4] [2] 
[4] | [4] [0] [1] [2] [8] [4] |} [0] [4] [38] [2] [1] 


We call the set Z5 = {[0],[1],[21,[31,[4]} the integers modulo 5. As our 
tables suggest, Zs is more than just a set: It is a little number system with 
its own addition and multiplication. In this way it is like the familiar set Z 
which also comes equipped with an addition and a multiplication. 

Of course, there is nothing special about the number 5. We can also 
define Z, for any natural number n. Here is the definition: 


Definition 11.6 Let neN. The equivalence classes of the equivalence 
relation = (mod n) are [0],[1],[2],...,[.—-1]. The integers modulo n is 
the set Z, = {[0],[11,[2],...,[2 — 11}. Elements of Z, can be added by the 
rule [a]+[b]=[a+6] and multiplied by the rule [a]-[b] = [ad]. 


Given a natural number n, the set Z, is a number system containing n 
elements. It has many of the algebraic properties that Z,R and Q possess. 
For example, it is probably obvious to you already that elements of Z, obey 
the commutative laws [a] +[b] = [6]+[a] and [a]-[b] =[b]-[a]. You can also 
verify the distributive law [a]-([b]+[c]) = [a]-[b]+[a]-[c], as follows: 


[a]-([b]+[cl) = [a]-[b+c] 
= [a(b+c)] 
= [ab+ac] 
= [ab]+[ac] 
= [a]-[6]+[a]-[cl. 


The integers modulo n are significant because they more closely fit certain 
applications than do other number systems such as Z or R. If you go on to 


Free PDF version [S)= 


220 Relations 


take a course in abstract algebra, then you will work extensively with Z, as 
well as other, more exotic, number systems. (In such a course you will also 
use all of the proof techniques that we have discussed, as well as the ideas 
of equivalence relations.) 

To close this section we take up an issue that may have bothered you 
earlier. It has to do with our definitions of addition [a]+[b] = [a+b] and 
multiplication [a]-[b] = [ab]. These definitions define addition and multi- 
plication of equivalence classes in terms of representatives a and b in the 
equivalence classes. Since there are many different ways to choose such 
representatives, we may well wonder if addition and multiplication are 
consistently defined. For example, suppose two people, Alice and Bob, want 
to multiply the elements [2] and [3] in Z5. Alice does the calculation as 
[2]-[3] = [6] = [1], so her final answer is [1]. Bob does it differently. Since 
[2] = [7] and [3] = [8], he works out [2]-[3] as [7]-[8] =[56]. Since 56 = 1 (mod 5), 
Bob’s answer is [56] =[1], and that agrees with Alice’s answer. Will their 
answers always agree or did they just get lucky (with the arithmetic)? 

The fact is that no matter how they do the multiplication in Z,, their 
answers will agree. To see why, suppose Alice and Bob want to multiply 
the elements [a],[b] € Z,, and suppose [a] = [a’] and [6] =[b’]. Alice and Bob 
do the multiplication as follows: 


Alice: [a]-[b] = [ad], 
Bob: [a’]-[b']=[a'b’]. 


We need to show that their answers agree, that is, we need to show [ab] = 
[a'b’]. Since [a] =[a'], we know by Theorem 11.1 that a =a’ (mod n). Thus 
n|(a—a’'), soa—a' =nk for some integer k. Likewise, as [b]=[b'], we know 
b =D)’ (mod n), or n|(b—b’), so b—b' =né for some integer ¢. Thus we get 
a=a'+nk and b=b'+né. Therefore: 


ab = (a'+nk)(b'+né) 


= a'b'+a'nl+nkb' +n7ke. 


Hence ab-—a'b! =n(a'€+kb'+nk@). This means n | (ab—a’b’), so ab =a'b' 
(mod n), and from that we conclude [ab] = [a’b’]. Consequently Alice and 
Bob really do get the same answer, so we can be assured that the definition 
of multiplication in Z, is consistent. 

Exercise 8 (below) asks you to prove that addition in Z, is similarly 
consistent. 


Richard Hammack Book of Proof 


Relations Between Sets 221 


Exercises for Section 11.5 


. Write the addition and multiplication tables for Z2. 
. Write the addition and multiplication tables for Z3. 
. Write the addition and multiplication tables for Z4. 
. Write the addition and multiplication tables for Zg. 


a fF WN = 


. Suppose [a],[b]€ Z5 and [a]-[b] =[0]. Is it necessarily true that either [a] = [0] or 
[b] = [0]? 
6. Suppose [a],[b]le Zg and [a]-[b] =[0]. Is it necessarily true that either [a] = [0] or 
[b] = [0]? What if [a],[b] € Z7? 
7. Do the following calculations in Z9, in each case expressing your answer as [a] 
with 0<a<8. 
(a) [8]+[8] (b) [24]+ [11] (e) [21]-[15] (d) [8]-[8] 
8. Suppose [a],[b] € Z,, and [a] =[a’] and [b] = [b’]. Alice adds [a] and [6] as [a]+[b] = 
[a+b]. Bob adds them as [a’]+[b']=[a’+ 6']. Show that their answers [a + b] and 
[a’ + b’] are the same. 


11.6 Relations Between Sets 


In the beginning of this chapter, we defined a relation on a set A to bea 
subset RCA x A. This created a framework that could model any situation 
in which elements of A are compared to themselves. In this setting, the 
statement xRy has elements x and y from A on either side of the R because R 
compares elements from A. But there are other relational symbols that don’t 
work this way. Consider ¢. The statement 5€« Z expresses a relationship 
between 5 and Z (namely that the element 5 is in the set Z) but 5 and 
Z are not in any way naturally regarded as both elements of some set A. 
To overcome this difficulty, we generalize the idea of a relation on A toa 
relation from A to B. 


Definition 11.7 A relation from a set A toa set Bisasubset RC AxB. 
We often abbreviate the statement (x, y)¢ R asxRy. The statement (x, y) ¢ 
R is abbreviated as xf y. 


Example 11.17 Suppose A = {1,2} and B= YA(A) = {@, {1}, {2}, {1,2}}. Then 
R = {(1,{1}), (2, {2}), (1,{1,2}), (2, {1,2})} ¢ A xB is a relation from A to B. Note 
that 1R{1}, 2R{2!, 1R{1,2} and 2R{1,2}. The relation R is the familiar 
relation € for the set A, that is, xR X means exactly the same thing as xe X. 


Free PDF version [S)= 


222 Relations 


Diagrams for relations from A to B differ from diagrams for relations 
on A. Since there are two sets A and B in a relation from A to B, we have 
to draw labeled nodes for each of the two sets. Then we draw arrows from x 
to y whenever xR y. The following figure illustrates this for Example 11.17. 


Figure 11.3. A relation from A to B 


The ideas from this chapter show that any relation (whether it is a 
familiar one like =, <, =, |, € or S, or a more exotic one) is really just a 
set. Therefore the theory of relations is a part of the theory of sets. In 
the next chapter, we will see that this idea touches on another important 
mathematical construction, namely functions. We will define a function to 
be a special kind of relation from one set to another, and in this context we 
will see that any function is really just a set. 


Richard Hammack Book of Proof 


CHAPTER 12 


Functions 


ye know from algebra and calculus that functions play a fundamental 
role in mathematics. You may view a function as a kind of formula that 
describes a relationship between two (or more) quantities. You certainly 
understand and appreciate the fact that relationships between quantities 
are central to all scientific disciplines, so you do not need to be convinced 
that functions are important. Still, you may not be fully aware of the 
significance of functions. Functions are more than merely descriptions of 
numeric relationships. In a more general sense, functions can compare and 
relate different kinds of mathematical structures. You will see this as your 
understanding of mathematics deepens. In preparation of this, we will now 
explore a more general and versatile view of functions. 

The concept of a relation between sets (Definition 11.7) plays a big role 
here, so you may want to quickly review it. 


12.1 Functions 


Let’s start on familiar ground. Consider the function f(x) = x? from R to R. 
Its graph is the set of points R = {(x,x?):xeR} CRxR. 


Figure 12.1. A familiar function 


Having read Chapter 11, you may see f in a new light. Its graph R CRxR 
is a relation on the set R. In fact, as we shall see, functions are just special 
kinds of relations. Before stating the exact definition, we look at another 


224 Functions 


example. Consider the function f(n) = |n|+2 that converts integers n into 
natural numbers |n|+2. Its graph is R = {(n,|n|+2):neZ}oZ*xN. 


N 

“a 
eo 0 0660 000 6 
oe 0 © 506 o 0 @ 0 
o 0 @ 046 0 @® 0 0o 
°o 0 0 @36 @® 0 0 oO 


reed Z 


KO Oe Oe Orr Que 
-4 -3 -2 -1 0 


FO 
wo 
: 

LO 


Figure 12.2. The function f :Z—N, where f(n) =|n|+2 


Figure 12.2 shows the graph R as darkened dots in the grid of points Z x N. 
Notice that in this example R is not a relation on a single set. The set of 
input values Z is different from the set N of output values, so the graph 
R¢ZxNisa relation from Z to N. 

This example illustrates three things. First, a function can be viewed as 
sending elements from one set A to another set B. (In the case of f, A=Z 
and B=N.) Second, such a function can be regarded as a relation from A to 
B. Third, for every input value n, there is exactly one output value f(n). In 
your high school algebra course, this was expressed by the vertical line test: 
Any vertical line intersects a function’s graph at most once. It means that 
for any input value x, the graph contains exactly one point of form (x, f(x)). 
Our main definition, given below, incorporates all of these ideas. 


Definition 12.1 Suppose A and B are sets. A function f from A to B 
(denoted as f: A — B) is a relation f < A xB from A to B, satisfying the 
property that for each ac A the relation f contains exactly one ordered 
pair of form (a,b). The statement (a,b)¢ f is abbreviated f(a) = b. 


Example 12.1 Consider the function f : Z — N graphed in Figure 12.2. 
According to Definition 12.1, we regard f as the set of points in its graph, 
that is, f = {(n,|n|+2):neZ} ¢ ZxN. This is a relation from Z to N, and 
indeed given any a€ Z the set f contains exactly one ordered pair (a, |a| + 2) 
whose first coordinate is a. Since (1,3) € f, we write f(1) = 3; and since 
(-3,5) € f we write f(—3)=5, etc. In general, (a,b) « f means that f sends 
the input value a to the output value b, and we express this as f(a) = b. This 


Richard Hammack Book of Proof 


Functions 225 


function can be expressed by a formula: For each input value n, the output 
value is |n|+2, so we may write f(n)=|n|+2. All this agrees with the way 
we thought of functions in algebra and calculus; the only difference is that 
now we also think of a function as a relation. 


Definition 12.2 For a function f : A — B, the set A is called the domain 
of f. (Think of the domain as the set of possible “input values” for f.) The 
set B is called the codomain of f. The range of f is the set {f(a):a¢ A} 
= {b: (a,b) € f}. (Think of the range as the set of all possible “output 
values” for f. Think of the codomain as a sort of “target” for the outputs.) 


Consider the function f :Z — N, where f(n) = |n|+2, from Example 12.1. 
The domain is Z and the codomain is N. The range of this function is the 
set {f(a):a€ Z} = {la|+2:a€ Z} = {2,3,4,5,...}. Notice that the range is a 
subset of the codomain N, but it does not (in this case) equal the codomain. 
In general, the range of a function is a subset of the codomain. In this sense 
the codomain could have been any set that contains the range. We might 
just as well have said that this f is a function f :Z— Z, or even f:Z—R. 

This illustrates an important point: the codomain of a function is not 
an intrinsic feature of the function; it is more a matter of choice or context. 
In Example 12.1 we chose N as the codomain because all the output values 
of f are natural numbers. But in general, the codomain of a function can 
be any set that contains the function’s range as a subset. 

In our examples so far, the domains and codomains are sets of numbers, 
but this needn’t be the case in general, as the next example indicates. 


Example 12.2 Let A= {p,q,r,s} and B = {0,1,2}, and 
f = {(»,0),(q, 1), (r, 2), (s,2)} SA x B. 


This is a function f :A — B because each element of A occurs exactly once 
as a first coordinate of an ordered pair in f. Observe that we have f(p) =0, 
f(q)=1, f(r) =2 and f(s) =2. The domain of this function is A = {p,q,r,s}. 
The codomain and range are both B = {0,1,2}. 

If A and B are not both sets of numbers (as in this example), it can be 
difficult to draw a graph of f : A — B in the traditional sense. Figure 12.3(a) 
is an attempt at a graph of f. The sets A and B are aligned as x- and y-axes, 
and the Cartesian product A x B is filled in accordingly. The subset f ¢AxB 
is indicated with dashed lines, and we can regard it as a “graph” of f. 
Figure 12.3(b) shows a more natural visual description of f. The sets A and 
B are drawn side-by-side, and arrows point from a to b whenever f(a) = b. 


Free PDF version COE 


226 Functions 


BN, cone A B 
'(p,0)\(q,0) (7,0) (s,0) 


s 
s 
s s 
s 


@DiG@D\@,1) (8,1) 4 

ete ape a re eee 

(p,2) (q,2)\ (7,2) (8,2) CX 
(a) (b) 


Figure 12.3. Two ways of drawing the function f = {(p,0),(q, 1), (r, 2), (s,2)} 


In general, if f : A — B is the kind of function you may have encountered 
in algebra or calculus, then conventional graphing techniques offer the best 
visual description of it. On the other hand, if A and B are finite or if we are 
thinking of them as generic sets, then describing f with arrows is often a 
more appropriate way of visualizing it. 

We emphasize that, according to Definition 12.1, a function is really 
just a special kind of set. Any function f: A — B is a subset of Ax B. By 
contrast, your calculus text probably defined a function as a certain kind of 
“rule.” While that intuitive outlook is adequate for the first few semesters 
of calculus, it does not hold up well to the rigorous mathematical standards 
necessary for further progress. The problem is that words like “rule” are 
too vague. Defining a function as a set removes the ambiguity. It makes a 
function into a concrete mathematical object. 

Still, in practice we tend to think of functions as rules. Given f :Z—N 
where f(n)=|n|+2, we think of this as a rule that associates any number 
neé Z to the number |n|+2 in N, rather than a set containing ordered pairs 
(n,|n|+2). It is only when we have to understand or interpret the theoretical 
nature of functions (as we do in this text) that Definition 12.1 comes to bear. 
The definition is a foundation that gives us license to think about functions 
in a more informal way. For instance, suppose we are discussing a set of 
functions, such as the set S of all functions R— R. Without Definition 12.1, 
it would be unclear just what kinds of objects the elements of S are. But 
with Definition 12.1, we know exactly what the elements are: each element 
of S is asubset f CRxR. So we are free to think of S as a collection of “rules,” 
and can fall back on Definition 12.1 when greater scrutiny is required. 


Richard Hammack Book of Proof 


Functions 227 


The next example brings up a point about notation. Consider a function 
such as f :Z? — Z, whose domain is a Cartesian product. This function takes 
as input an ordered pair (m,n) «€ Z? and sends it to a number f((m,n)) € Z. 
To simplify the notation, it is common to write f(m,n) instead of f((m,n)), 
even though this is like writing fx instead of f(x). We also remark that 
although we’ve been using the letters f, g and h to denote functions, any 
other reasonable symbol could be used. Greek letters such as y and 6 are 
common. 


Example 12.3 Say a function ¢ : Z? — Z is defined as y(m,n) = 6m —-9n. 
Note that as a set, this function is y = {((m,n),6m-—9n) : (m,n) € Z?} CZ? x Z. 
What is the range of y? 


To answer this, first observe that for any (m,n) € Z?, the value y(m,n) = 
6m — 9n = 3(2m — 3n) is a multiple of 3. Thus every number in the range is 
a multiple of 3, so the range is a subset of the set of all multiples of 3. On 
the other hand if 6 = 32 is a multiple of 3 we have y(-k, —k) = 6(—k) — 9(-k) = 
3k = b, which means any multiple of 3 is in the range of y. Therefore the 
range of ¢ is the set {3k:k€ Z} of all multiples of 3. 


To conclude this section, let’s use Definition 12.1 to help us understand 
what it means for two functions f: A — B and g:C — D to be equal. The 
definition says f and g are subsets f CA xB and gcC xD. It makes sense 
to say that f and g are equal if f = g, that is, if they are equal as sets. 

Thus the two functions f = {(1,a),(2,a),(3,b)} and g = {(3,b),(2,a),(1,a)} 
are equal because the sets f and g are equal. Notice that the domain of 
both functions is A = {1,2,3}, the set of first elements x in the ordered pairs 
(x,y) € f =g. In general, equal functions must have equal domains. 

Observe also that the equality f = g means f(x) = g(x) for every xe A. 
We repackage these ideas in the following definition. 


Definition 12.3 Two functions f: A — B and g: A —D are equal if 
f =g (as sets). Equivalently, f = g if and only if f(x) = g(x) for every xe A. 


Observe that f and g can have different codomains and still be equal. 
Consider the functions f :Z— N and g:Z — Z defined as f(x) = |x|+2 and 
g(x) = |x|+2. Even though their codomains are different, the functions are 
equal because f(x) = g(x) for every x in the domain. If you are bothered that 
these equal functions have different codomains, recall that we remarked 
on page 225 that a function’s codomain is not really an intrinsic feature of 
the function, but more a matter of convenience. (Any set that contains the 
range as a subset is a valid choice of codomain.) 


Free PDF version [S)= 


228 Functions 


Exercises for Section 12.1 
1. Suppose A = {0,1,2,3,4}, B = {2,3,4,5} and f = {(0,3),(1,3), (2,4), (3, 2),(4,2)}. State 
the domain and range of f. Find f(2) and f(1). 
2. Suppose A = {a,b,c,d}, B = {2,3,4,5,6} and f = {(a,2),(b,3),(c,4),(d,5)}. State the 
domain and range of f. Find f(b) and f(d). 
3. There are four different functions f : {a,b} — {0,1}. List them. Diagrams suffice. 
4. There are eight different functions f : {a,b,c} — {0,1}. List them. Diagrams 
suffice. 
5. Give an example of a relation from {a,b,c,d} to {d,e} that is not a function. 
6. Suppose f : Z — Z is defined as f = {(x,4x+5):xe Z}. State the domain, codomain 
and range of f. Find (10). 
7. Consider the set f = {(x,y)€ Zx Z:3x+y=4}. Is this a function from Z to Z? 
Explain. 
8. Consider the set f = {(x,y)€ Zx Z:x+3y =4}. Is this a function from Z to Z? 
Explain. 
9. Consider the set f = {(x?,x):x« eR}. Is this a function from R to R? Explain. 
10. Consider the set f = {(x*,x):x eR}. Is this a function from R to R? Explain. 
11. Is the set 0 = {(X,|X|):X Zs} a function? If so, what is its domain and range? 
12. Is the set 0 = {((x,y),(8y,2x,x+ y)): x,y €R} a function? If so, what is its domain 
and range? What can be said about the codomain? 


12.2 Injective and Surjective Functions 


You may recall from algebra and calculus that a function may be one-to-one 
and onto, and these properties are related to whether or not the function is 
invertible. We now review these important ideas. In advanced mathematics, 
the word injective is often used instead of one-to-one, and surjective is used 
instead of onto. Here are the exact definitions: 


Definition 12.4 A function f:A-— B is: 
1. injective (or one-to-one) if for all a,a’e A, a#a’ implies f(a) # f(a’); 
2. surjective (or onto B) if for every b € B there is anaeéA with f(a)=); 


3. bijective if f is both injective and surjective. 


Below is a visual description of Definition 12.4. In essence, injective 
means that unequal elements in A always get sent to unequal elements 
in B. Surjective means that every element of B has an arrow pointing to it, 
that is, it equals f(a) for some a in the domain of f. 


Richard Hammack Book of Proof 


Injective and Surjective Functions 229 


Injective means that for any 
two a,a’<« A, this happens... 


...and not this: 


Surjective means that for 


say BEB ...this happens: 


For more concrete examples, consider the following functions f, g:R—R. 
The function f(x) = x? is not injective because —2 4 2, but f(—2) = f(2). Nor 
is it surjective, for if b = —1 (or if 6 is any negative number), then there is 
no a€ R with f(a) = 6. On the other hand, g(x) = x? is both injective and 
surjective, so it is also bijective. 

Notice that whether or not f is surjective depends on its codomain. For 
example, f(x) =x? is not surjective as a function R— R, but it is surjective 
as a function R — [0,o0). When we speak of a function being surjective, we 
always have in mind a particular codomain. 

There are four possible injective/surjective combinations that a function 
may possess. This is illustrated below for four functions A — B. Functions 
in the first column are injective, those in the second column are not injective. 
Functions in the first row are surjective, those in the second row are not. 


Injective Not injective 


Surjective 


Not surjective 


We note in passing that, according to the definitions, a function is 
surjective if and only if its codomain equals its range. 


Free PDF version [S)= 


230 Functions 


Often it is necessary to prove that a particular function f : A — B is 
injective. For this, Definition 12.4 says we must prove that for any two 
elements a,a’¢ A, the conditional statement (a 4 a’) => (f(a) 4 f(a’)) is true. 
The two main approaches for this are summarized below. 


How to show a function / : A — B is injective: 


Direct approach: Contrapositive approach: 
Suppose a,a’e A anda #a’. Suppose a,a’e A and f(a) = f(a’). 
Therefore f(a) # f(a’). Therefore a =a’. 


Of these two approaches, the contrapositive is often the easiest to use, 
especially if f is defined by an algebraic formula. This is because the 
contrapositive approach starts with the equation f(a)= f(a’) and proceeds 
to the equation a =a'. In algebra, as you know, it is usually easier to work 
with equations than inequalities. 

To prove that a function is not injective, you must disprove the statement 
(a 4a’')=> (f(a)# f(a’)). For this it suffices to find an example of two elements 
a,a'<€ A for which a £a’ and f(a) = f(a’). 

Next we examine how to prove that f: A — B is surjective. According 
to Definition 12.4, we must prove the statement Vb¢ B,jacA,f(a)=b. In 
words, we must show that for any b € B, there is at least one ae A (which 
may depend on bd) having the property that f(a) = 6. Here is an outline: 


How to show a function f : A — B is surjective: 


Suppose be« B. 
[Prove there exists a¢ A for which f(a) = b.] 


The second line involves proving the existence of an a for which f(a) = 6. 
For this, just finding an example of such an a would suffice. (How to find 
such an example depends on how f is defined. If f is given as a formula, 
we may be able to find a by solving the equation f(a) = b for a. Sometimes 
you can find a by just plain common sense.) To show f is not surjective, we 
must prove the negation of Vb « B,Ja¢ A, f(a) =b, that is, we must prove 
AbEB,VacA,fla)Fb. 

The following examples illustrate these ideas. (For the first example, 
note that the set R- {0} is R with the number 0 removed.) 


Richard Hammack Book of Proof 


Injective and Surjective Functions 231 


Example 12.4 Show that the function f : R— {0} — R defined as f(x) =4+1 
is injective but not surjective. 

We will use the contrapositive approach to show that f is injective. 
Suppose a,a’ « R-{0} and f(a) = f(a’). This means 4+1=4+1. Subtracting 1 
from both sides and inverting produces a =a’. Therefore f is injective. 

The function f is not surjective because there exists an element b=1eR 
for which f(x) =4+1#1 for every xe R-{O}. 


Example 12.5 Show that the function f : R— {0} — R-{1} where f(x) =++1 
is injective and surjective (hence bijective). 

This is just like the previous example, except that the codomain has 
been changed. The previous example shows f is injective. To show that it 
is surjective, take an arbitrary b « R— {1}. We seek an ae R— {0} for which 
f(a) = b, that is, for which 1+ 1=b. Solving for a gives a = rp which is 
defined because b 41. In summary, for any b € R— {1}, we have f (4) =). 
so f is surjective. 


Example 12.6 Show that the function g: Z x Z — Zx Z defined by the 
formula g(m,n)= (m+n,m-+2n), is both injective and surjective. 

We will use the contrapositive approach to show that g is injective. 
Thus we need to show that g(m,n) = g(k,¢) implies (m,n) = (k, 2). Suppose 
(m,n),(k,@)€ ZxZ and g(m,n) = gk, @). Then (m+n,m+2n) = (k+@,k+2¢@). It 
follows that m+n=kh+f and m+2n=k+2¢. Subtracting the first equation 
from the second gives n = ¢. Next, subtract n= @ from m+n=hk-+ @ to get 
m=k. Since m=k and n= Z, it follows that (m,n) =(k,@). Thus g is injective. 

To see that g is surjective, consider an arbitrary element (b,c) «€ Z x Z. 
We need to show that there is some (x,y) € Z x Z for which g(x, y) = (b,c). To 
find (x,y), note that g(x,y) = (b,c) means (x+ y,x+2y)=(b,c). This leads to 
the following system of equations: 

x + y = b 
x + 2y = e. 

Solving gives x = 2b—c and y=c—b. Then (x,y) =(2b—c,c—b). We now 

have g(26 —c,c—b)=(b,c), and it follows that g is surjective. 


Example 12.7 Consider function h : Z x Z — Q defined as h(m,n) = Er 


Determine whether this is injective and whether it is surjective. 

This function is not injective because of the unequal elements (1,2) and 
(1,-2) in Z x Z for which A(1,2) = A(1, —-2) = ; However, h is surjective: Take 
any element be Q. Then b = § for some c,d e€ Z. Notice we may assume d is 
positive by making c negative, if necessary. Then h(c,d—1)= ri = q=5. 


Free PDF version [S)= 


232 Functions 


Exercises for Section 12.2 


1. 


15. 


16. 


17. 


18. 
19. 


. Prove that the function f : R— {2} — R-{5} defined by f(x) = 
. Prove the function f : R- {1} — R-{1} defined by f(x) = (#42) is bijective. 


Let A = {1,2,3,4} and B = {a,b,c}. Give an example of a function f : A — B that is 
neither injective nor surjective. 


. Consider the logarithm function In : (0,00) — R. Decide whether this function is 


injective and whether it is surjective. 


. Consider the cosine function cos :R — R. Decide whether this function is injective 


and whether it is surjective. What if it had been defined as cos: R — [-1,1]? 


. Afunction f :Z — Zx Z is defined as f(n) = (2n,n+8). Verify whether this function 


is injective and whether it is surjective. 


. A function f : Z — Z is defined as f(n) = 2n+1. Verify whether this function is 


injective and whether it is surjective. 


. A function f : Z x Z — Z is defined as f(m,n) = 83n-—4m. Verify whether this 


function is injective and whether it is surjective. 


. A function f : Zx Z — Z is defined as f(m,n) = 2n-4m. Verify whether this 


function is injective and whether it is surjective. 


. A function f :Z x Z— Z~x Z is defined as f(m,n)=(m+n,2m+n). Verify whether 


this function is injective and whether it is surjective. 


5x+1 
x 


is bijective. 


x-1 


. Consider the function 6 : {0,1} x N— Z defined as 0(a,b) = (—1)b. Is @ injective? 


Is it surjective? Bijective? Explain. 


. Consider the function 6 : {0,1} xN— Z defined as 0(a,b) = a—2ab+b. Is @ injective? 


Is it surjective? Bijective? Explain. 


. Consider the function f : R? — R? defined by the formula f(x,y) = (xy,x?). Is f 


injective? Is it surjective? Bijective? Explain. 


. Consider the function 6: A(Z) — A(Z) defined as 0(X) =X. Is 0 injective? Is it 


surjective? Bijective? Explain. 

This question concerns functions f : {A,B,C,D,E,F,G} — {1,2,3,4,5,6,7}. How 
many such functions are there? How many of these functions are injective? How 
many are surjective? How many are bijective? 

This question concerns functions f :{A,B,C,D,E} — {1,2,3,4,5,6,7}. How many 
such functions are there? How many of these functions are injective? How many 
are surjective? How many are bijective? 

This question concerns functions f :{A,B,C,D,E,F,G} — {1,2}. How many such 
functions are there? How many of these functions are injective? How many are 
surjective? How many are bijective? 

Prove that the function f :N — Z defined as f(n) = Ur Cn—Veh is bijective. 
Prove that f :N x N—N defined as f(m,n) = 2”~1(2n — 1) is bijective. 


Richard Hammack Book of Proof 


The Pigeonhole Principle Revisited 233 


12.3 The Pigeonhole Principle Revisited 


We first encountered a result called the pigeonhole principle in Section 3.9. 
It turns out that the pigeonhole principle has a useful phrasing in the 
language of injective and surjective functions, and we now discuss this. Our 
discussion will not use any material from Chapter 3, so it does not matter 
if you skipped it. 

The pigeonhole principle is motivated by a simple thought experiment: 
Imagine there is a set A of pigeons and a set B of pigeonholes, and all 
the pigeons fly into the pigeonholes. You can think of this as describing a 
function f :A — B, where pigeon p flies into pigeonhole f(p). See Figure 12.4. 


Pigeons Pigeonholes Pigeons Pigeonholes 
f f 
: 
(a) (b) 


Figure 12.4. The pigeonhole principle 


In Figure 12.4(a) there are more pigeons than pigeonholes, and it is 
obvious that in such a case at least two pigeons have to fly into the same 
pigeonhole, meaning that f is not injective. In Figure 12.4(b) there are 
fewer pigeons than pigeonholes, so clearly at least one pigeonhole remains 
empty, meaning that f is not surjective. 

Although the underlying idea expressed by these figures has little to do 
with pigeons, it is nonetheless called the pigeonhole principle: 


The Pigeonhole Principle (function version) 

Suppose A and B are finite sets and f : A — B is any function. 
1. If |A|>|B|, then f is not injective. 

2. If |A|<|B|, then f is not surjective. 


Though the pigeonhole principle is obvious, it can be used to prove some 
things that are not so obvious. Two examples follow. 


Free PDF version [S)= 


234 Functions 


Proposition If A is any set of 10 integers between 1 and 100, then there 
exist two different subsets X ¢ A and Y CA for which the sum of elements 
in X equals the sum of elements in Y. 


To illustrate what this proposition is saying, consider the random set 
A= {5, 7, 12, 11, 17, 50, 51, 80, 90, 100} 


of 10 integers between 1 and 100. Notice that A has subsets X = {5,80} and 
Y ={7,11,17,50} for which the sum of the elements in X equals the sum of 
those in Y. If we tried to “mess up” A by changing the 5 to a 6, we get 


A = {6, 7, 12, 11, 17, 50, 51, 80, 90, 100} 


which has subsets X = {7,12,17,50} and Y = {6,80} both of whose elements 
add up to the same number (86). The proposition asserts that this is always 
possible, no matter what A is. Here is a proof: 


Proof. Suppose A € {1,2,3,4,...,99,100} and |A| = 10, as stated. Notice that 
if X <A, then X has no more than 10 elements, each between 1 and 100, 
and therefore the sum of all the elements of X is less than 100-10 = 1000. 
Consider the function 


f : P(A) — {0, 1, 2,3, 4,..., 1000}, 


where /(X) is the sum of the elements in X. (Examples: f ({3,7,50}) = 60; 
f ({1, 70,80, 95}) = 246.) As | A(A)| = 2" = 1024 > 1001 = | {0,1,2,3,..., 1000}, it 
follows from the pigeonhole principle that f is not injective. Therefore there 
are two unequal sets X,Y ¢ A(A) for which f(X) = f(Y). In other words, 
there are subsets X ¢ A and Y CA for which the sum of elements in X 
equals the sum of elements in Y. | 


Proposition There are at least two Texans with the same number of 
hairs on their heads. 


Proof. We will use two facts. First, the population of Texas is more than 
twenty million. Second, it is a biological fact that every human head has 
fewer than one million hairs. Let A be the set of all Texans, and let B = 
{0,1,2,3,4,...,1000000}. Let f : A — B be the function for which f(x) equals 
the number of hairs on the head of x. Since |A| > |B|, the pigeonhole principle 
asserts that f is not injective. Thus there are two Texans x and y for whom 
f(x) = f(y), meaning that they have the same number of hairs on their 
heads. a 


Richard Hammack Book of Proof 


Composition 235 


Proofs that use the pigeonhole principle tend to be inherently non- 


constructive, in the sense discussed in Section 7.4. For example, the above 
proof does not explicitly give us two Texans with the same number of hairs 
on their heads; it only shows that two such people exist. If we were to make 
a constructive proof, we could find examples of two bald Texans. Then they 
have the same number of head hairs, namely zero. 


Exercises for Section 12.3 


1. 


Prove that if six integers are chosen at random, then at least two of them will 
have the same remainder when divided by 5. 


. Prove that ifa is a natural number, then there exist two unequal natural numbers 


k and @ for which a* — a’ is divisible by 10. 


. Prove that for any six integers, 9 divides the sum or difference of two of them. 
. Consider a square whose side-length is one unit. Select any five points from 


inside this square. Prove that at least two of these points are within ue units of 


each other. 


. Prove that any set of seven distinct integers contains a pair of integers whose 


sum or difference is divisible by 10. 


. Given a sphere S, a great circle of S is the intersection of S with a plane through 


its center. Every great circle divides S into two parts. A hemisphere is the union 
of the great circle and one of these two parts. Prove that if five points are placed 
arbitrarily on S, then there is a hemisphere that contains four of them. 


. Prove or disprove: Any subset X ¢ {1,2,3,...,2n} with |X|>n contains two (un- 


equal) elements for which one divides the other. 


12.4 Composition 


You are familiar with the notion of function composition from algebra and 
calculus. Still, it is worthwhile to revisit it now with our more sophisticated 
ideas about functions. 


Definition 12.5 Suppose f:A—B and g:B—C are functions with the 
property that the codomain of f equals the domain of g. The composition 
of f with g is another function, denoted as gof and defined as follows: If 
x€ A, then gof (x)= g(f(x)). Therefore gof sends elements of A to elements 
of C, so gof:A—-C. 


The following figure illustrates the definition. Here f:A—B, g:B—C, 


and gof:A—C. We have, for example, gof(0) = g(f(0)) = g(2)=4. Be very 


Free PDF version [S)= 


236 Functions 


careful with the order of the symbols. Even though g comes first in the 
symbol gof, we work out gof(x) as g(f(x)), with f acting on x first, followed 
by g acting on f(x). 


A B C 
f g 
Sa 
A C 


Figure 12.5. Composition of two functions 


Notice that the composition gof also makes sense if the range of f 
is a subset of the domain of g. You should take note of this fact, but to 
keep matters simple we will continue to emphasize situations where the 
codomain of f equals the domain of g. 


Example 12.8 Suppose A = {a,b,c}, B = {0,1}, C = {1,2,3}. Let f:A—-B 
be the function f = {(a,0),(b, 1),(c,0)}, and let g:B — C be g = {(0,3),(1, D}. 
Then go f = {(a,3),(b, 1),(c,3)}. 


Example 12.9 Say A = {a,b,c}, B = {0,1}, C = {1,2,3}. Let f:A—-B 
be the function f = {(a,0),(b,1),(c,0)}, and let g: C — B be the function 
g = {(1,0),(2, 1),(3, D}. In this situation the composition gof is not defined 
because the codomain B of f is not the same set as the domain C of g. 
Remember: In order for go f to make sense, the codomain of f must equal 
the domain of g. (Or at least be a subset of it.) 


Example 12.10 Let f:R—R be defined as f(x) = x2+«, and g:R—R be 
defined as g(x) =x+1. Then gof :R— Ris the function defined by the formula 
eof (x)=g(f(x)) = g(x? +x)=x7 4x41. 

Since the domains and codomains of g and f are the same, we can in 
this case do a composition in the other order. Note that fog:R— Ris the 
function defined as f og(x) = f(g(x)) = f(x+1) = (x+ 1)? + (x41) = x7 43x42. 

This example illustrates that even when gof and fog are both defined, 
they are not necessarily equal. We can express this fact by saying function 
composition is not commutative. 


Richard Hammack Book of Proof 


Composition 237 


We close this section by proving several facts about composition that you 
are likely to encounter in your future study of mathematics. First, we note 
that, although it is not commutative, function composition is associative. 


Theorem 12.1 Composition of functions is associative. That is if f:A— B, 
g:B—-C andh:C-—D, then (hog)of =ho(gof). 


Proof. Suppose f,g,h are as stated. It follows from Definition 12.5 that 
both (hog)of and ho(gof) are functions from A to D. To show that they 
are equal, we just need to show 


((eog)of)(«)=(ho(gef))(x) 
for every x€ A. Note that Definition 12.5 yields 


[(rog)of (x)= (hogi(f(x)) = h{g(F))). 


Also 
[note f))@) =h(gof@) =h(e(Fe)}. 
Thus 
(‘rogof)(x)=(ho(gof))(a), 
as both sides equal h(a(f (x))}. a 


Theorem 12.2 Suppose f:A— Band g:B-—C. If both f and g are 
injective, then gof is injective. If both f and g are surjective, then go/f is 
surjective. 


Proof. First suppose both f and g are injective. To see that gof is injective, 
we must show that go f (x)= g0f(y) implies x = y. Suppose gof (x)=g0f(y). 
This means g(f(x)) = g(f(y)). It follows that f(x) = f(y). (For otherwise g 
wouldn’t be injective.) But since f(x) = f(y) and f is injective, it must be 
that x= y. Therefore gof is injective. 

Next suppose both f and g are surjective. To see that gof is surjective, 
we must show that for any element c€ C, there is a corresponding element 
aéA for which gof(a)=c. Thus consider an arbitrary ce C. Because g 
is surjective, there is an element b « B for which g(b) = c. And because 
f is surjective, there is an element ac A for which f(a) =b. Therefore 
ge(f(a)) = g(b) =c, which means gof(a)=c. Thus gof is surjective. a 


Free PDF version [S)= 


238 Functions 


Exercises for Section 12.4 


1. Suppose A = {5,6,8}, B = {0,1}, C = {1,2,3}. Let f: A — B be the function f = 
{(5,1),(6,0),(8, D}, and g:B > C be g= {(0,1),(1, D}. Find gof. 

2. Suppose A = {1,2,3,4}, B = {0,1,2}, C = {1,2,3}. Let f:A —B be f = {(1,0),(2,1), 
(3,2),(4,0)}, and g:B—C be g = {(0,1), (1, 1),(2,3)}. Find gof. 

3. Suppose A = {1,2,3}. Let f: A — A be the function f = {(1,2),(2,2),(3,1)}, and let 
g:A—A be the function g = {(1,3),(2,1),(3,2)}. Find gof and fog. 

4. Suppose A = {a,b,c}. Let f: A — A be the function f = {(a,c),(b,c),(c,c)}, and let 
g:A—A be the function g = {(a,a),(b,b),(c,a)}. Find gof and fog. 

5. Consider the functions f,g:R— R defined as f(x) = */x+1 and g(x) =«x?. Find the 
formulas for gof and fog. 

6. Consider the functions f,g:R— R defined as f(x) = = and g(x) = 3x+2. Find 
the formulas for gof and fog. 

7. Consider the functions f,g : Zx Z ~ Zx Z defined as f(m,n) = (mn,m?) and 
g(m,n)=(m+1,m+n). Find the formulas for gof and fog. 

8. Consider the functions f,g:Zx Z— Z~xZ defined as f(m,n) =(8m—4n,2m+n) and 
g(m,n)=(5m+n,m). Find the formulas for gof and fog. 

9. Consider the functions f : Z x Z — Z defined as f(m,n)=m+n and g:Z—ZxZ 
defined as g(m)=(m,m). Find the formulas for gof and fog. 

10. Consider the function f :R? — R? defined by the formula f(x, y) = (xy,x°). Find a 

formula for f of. 


12.5 Inverse Functions 


You may recall from calculus that if a function f is injective and surjective, 
then it has an inverse function f~! that “undoes” the effect of f in the sense 
that f~1(f(x)) =x for every x in the domain. (For example, if f(x) =x?, then 
f-1(x) = */x.) We now review these ideas. Our approach uses two ingredients, 
outlined in the following definitions. 


Definition 12.6 For aset A, the identity function on A is the function 
ia: A—A defined as i,(x)=<x for every xe A. 


For example, if A={1,2,3}, then i, = {(1, 1), (2,2),(3,3)}. Also iz={(n,n): ne Z}. 
The identity function on a set is the function that sends any element of the 
set to itself. 

Notice that for any set A, the identity function i, is bijective: It is 
injective because i 4(x) =i,4(y) immediately reduces to x= y. It is surjective 


Richard Hammack Book of Proof 


Inverse Functions 239 


because if we take any element b in the codomain A, then 6d is also in the 
domain A, and i,(b)=b. 


Definition 12.7 Given a relation R from A to B, the inverse relation 
of R is the relation from B to A defined as R~ = {(y, x): (x,y) € R}. In other 
words, the inverse of R is the relation R~! obtained by interchanging the 
elements in every ordered pair in R. 


For example, let A = {a,b,c} and B = {1,2,3}, and suppose f is the relation 
f = {(a,2),(b,3),(c, 1)} from A to B. Then f~1 = {(2,a),(3,b),(1,¢)} and this is a 
relation from B to A. Notice that f is actually a function from A to B, and 
f-1 is a function from B to A. These two relations are drawn below. Notice 
the drawing for relation f~! is just the drawing for f with arrows reversed. 


A B A B 
4 q 
Ma Ma 
f = {(a,2),(b,3),(c, D} F-+={@,0),@,6),(,0} 


For another example, let A and B be the same sets as above, but consider 
the relation g = {(a,2),(b,3),(c,3)} from A to B. Then g~4 = {(2,a),(3,b),(3,c)} 
is a relation from B to A. These two relations are sketched below. 


A B A B 
N x 
g = {(a,2),(b,3),(c,3)} g 1 ={(2,a),(3,b),(3,c)} 


This time, even though the relation g is a function, its inverse g~! is not 
a function because the element 3 occurs twice as a first coordinate of an 
ordered pair in gt. 

In the above examples, relations f and g are both functions, and f-! isa 
function and g~! is not. This raises a question: What properties does f have 
and g lack that makes f~! a function and g™! not a function? The answer 
is not hard to see. Function g is not injective because g(b) = g(c) = 3, and 
thus (b,3) and (c,3) are both in g. This causes a problem with g! because 
it means (3,b) and (3,c) are both in g~!, so g~! can’t be a function. Thus, in 
order for g~' to be a function, it would be necessary that g be injective. 


Free PDF version [S)= 


240 Functions 


But that is not enough. Function g also fails to be surjective because 
no element of A is sent to the element 1¢ B. This means g“! contains no 
ordered pair whose first coordinate is 1, so it can’t be a function from B to 
A. If g~! were to be a function it would be necessary that g be surjective. 

The previous two paragraphs suggest that if g is a function, then it must 
be bijective in order for its inverse relation g~! to be a function. Indeed, 
this is easy to verify. Conversely, if a function is bijective, then its inverse 
relation is easily seen to be a function. We summarize this in the following 
theorem. 


Theorem 12.3 Let f:4—B bea function. Then f is bijective if and only 
if the inverse relation f~! is a function from B to A. 


Suppose f : A — B is bijective, so according to the theorem f~! is a 
function. Observe that the relation f contains all the pairs (x, f(x)) for xe A, 
so f~ contains all the pairs (f(x),x). But (f(x),x)¢ f~! means f-1(f(«)) =x. 
Therefore f~!of (x)= «x for every x¢ A. From this we get f-!of =i,. Similar 
reasoning produces fo f~! =ig. This leads to the following definitions. 


Definition 12.8 If f:A-— B is bijective then its inverse is the function 
f-1:B— A. The functions f and f~! obey the equations f~!of =i, and 
fof '=iz. 


You probably recall from algebra at least one technique for computing 
the inverse of a bijective function f. To find f~!, start with the equation 
y = f(x). Then interchange variables to get x = f(y). Solving this equation 
for y (if possible) produces y = f~1(x). The next two examples illustrate this. 


Example 12.11 The function f :R— R defined as f(x) = x* +1 is bijective. 
Find its inverse. 


We begin by writing y = x°+1. Now interchange variables to obtain 
x= y>+1. Solving for y produces y = Vx—1. Thus 


f Mx) = Vx-1. 
(You can check your answer by computing 
fF@)= YF@)—1= Vx8+1-1=-. 


Therefore f~1(f(x)) =«. Any answer other than «x indicates a mistake.) 


Richard Hammack Book of Proof 


Inverse Functions 241 


Example 12.12 Example 12.6 showed that the function g:Zx Z-ZxZ 
defined by the formula g(m,n) = (m+n,m-+2n) is bijective. Find its inverse. 


The approach outlined above should work, but we need to be careful to 


keep track of coordinates in Z x Z. We begin by writing (x, y) = g(m,n), then 
interchanging the variables (x, y) and (m,n) to get (m,n) = g(x,y). This gives 


(m,n) =(x+47,x+2y), 


from which we get the following system of equations: 


x + y=m 
x + Qy = n. 
Solving this system, we get 
x = 2m-n 
y = n-m. 


Then (x, y)=(2m—n,n-m), so| g~\(m,n) = (2m—n,n—m). 


We can check this by confirming g~'(g(m,n)) = (m,n). Doing the math, 
g (g(m,n)) = g -(m+n,m+2n) 

= (2(m+n)-(m+2n),(m+2n)-(m+n)) 

= (m,n). 


Exercises for Section 12.5 


10. 


. Check that f :Z— Z defined by f(n) = 6—n is bijective. Then compute f~. 
. In Exercise 9 of Section 12.2 you proved that f : R- {2} — R- {5} defined by 


5x+1 
= . is bijective. Now find its inverse. 


f)= 


x- 


. Let B={2":neZ}={...,1,4,1,2,4,8,...}. Show that the function f : Z — B defined 


"2429? 
as f(n)= 2” is bijective. Then find f-!. 


. The function f :R— (0,0) defined as f(x) = er tis bijective. Find its inverse. 
. The function f :R—R defined as f(x) = 2x-e is bijective. Find its inverse. 
. The function f : Z x Z— Z x Z defined by the formula f(m,n) = (5m+4n,4m + 8n) 


is bijective. Find its inverse. 


. Show that the function f :R? — R? defined by the formula f(x, y) = ((x? + 1)y,x°) is 


bijective. Then find its inverse. 


. Is the function 6: A(Z) — A(Z) defined as 6(X) = X bijective? If so, find 6~!. 
. Consider the function f :RxN—NxR defined as f(x, y) = (y,3xy). Check that this 


is bijective; find its inverse. 


Consider f :N— Z defined as f(n) = SS This function is bijective by 


Exercise 18 in Section 12.2. Find its inverse. 


Free PDF version C\E 


242 Functions 


12.6 Image and Preimage 


It is time to take up a matter of notation that you will encounter in future 
mathematics classes. Suppose we have a function f:A —B. If X cA, the 
expression f(X) has a special meaning. It stands for the set {f(x):x e€X}. 
And if Y cB, then f~1(Y) has a meaning even if f is not invertible: it stands 
for the set {xe A: f(x)¢ Y}. Here are the precise definitions. 


Definition 12.9 Suppose f:A—B is a function. 
1. If XCA, the image of X is the set f(X) = {f(x):xEX}CB. 
2. If Y CB, the preimage of Y is the set f-\(Y)={xeA:f(x)eY}CA. 


In words, the image f(X) of X is the set of all things in B that f sends 
elements of X to. (Roughly speaking, you might think of f(X) as a kind of 
distorted “copy” or “image” of X in B.) The preimage f~!(Y) of Y is the set 
of all things in A that f sends into Y. 

Maybe you have already encountered these ideas in linear algebra, in 
a setting involving a linear transformation T : V — W between two vector 
spaces. If X <V is a subspace of V, then its image T(X) is a subspace of W. 
If Y CW is a subspace of W, then its preimage T~!(Y) is a subspace of V. 
(If this does not sound familiar, then ignore it.) 


Example 12.13 Let f : {s,t¢,u,v,w,x,y,z} — {0,1,2,3,4,5,6,7,8,9} be 
f = ((s,4), (t,8), (u, 8), (v, 1), (w, 2), (x, 4),(y,6),(z, 4}. 


This f is neither injective nor surjective, so it certainly is not invertible. Be 
sure you understand the following statements. 


1. f({s,t,u,z}) = {8,4} 5. f-1({4}) =4s,x,2} 

2... Fldsoxelp=44) 6. f-4({4, = {s,x,z} 

3. f({s,v,w,y}) = {1,2,4,6} 7. f-*({9}) = 

4. f(D)=G 8. fo aa {s,t,u,v,x,2} 


It is important to realize that the X and Y in Definition 12.9 are subsets 
(not elements!) of A and B. In Example 12.13 we had f~1({4}) = {s,x,z}, 
while f~1(4) is meaningless because the inverse function f~! does not exist. 
And there is a subtle difference between f({s}) = {4} and f(s) =4. Be careful. 


Example 12.14 Consider the function f :R— R defined as f(x) =x”. Note 
that f({0,1,2}) = {0,1,4} and f-1({0,1,4}) = {-2,-1,0,1,2}. This shows that 
f-1(f(X)) 4X in general. 


Richard Hammack Book of Proof 


Image and Preimage 243 


Using the same f, check your understanding of these statements about 
images and preimages of intervals: f([—2,3]) = [0,9], and f~1((0,9]) = [-3,3]. 
Also f(R) =[0,oo) and f~1([-2,-1) = @. 

If you continue with mathematics you will likely encounter the following 
results. For now, you are asked to prove them in the exercises. 


Theorem 12.4 Given f:A—B, let W,X cA, andY,ZcB. Then 


1. f(WOX) < FW) nf (X) 4. f-(YUZ) = fW)uf-WZ) 
2. f(WUX) = f(W)uf(X) 5. f UY AZ) = fFUY)nf-XZ) 
3. X < f-1(F(X)) 6. f(f-Y)) cY. 


Exercises for Section 12.6 


1. Consider the function f :R — R defined as f(x) = x7+3. Find f({-3,5]) and 
f-+([12, 19)). 
2. Consider the function f : {1,2,3,4,5,6,7} — {0,1,2,3,4,5,6,7,8,9} given as 


f ={ (1,3), (2,8), (3,3), (4, D, (5,2), (6,4), (7,6)}. 


Find: f({1,2,3}), £({4,5,6,7}), f(D), f-1({0,5,9}) and f-1({0,3,5, 9}). 

3. This problem concerns functions f : {1,2,3,4,5,6,7} — {0,1,2,3,4}. How many 
such functions have the property that |f~1({3})| = 3? 

4. This problem concerns functions / : {1,2,3,4,5,6,7,8} — {0,1,2,3,4,5,6}. How 
many such functions have the property that |f~1({2})| = 4? 


5. Consider a function f: A — B and a subset X CA. We observed in Example 12.14 
that f-1(f(X)) 4X in general. However X ¢ f~1(f(X)) is always true. Prove this. 


6. Given a function f : A — B and a subset Y CB, is f(f-\(Y)) = Y always true? 
Prove or give a counterexample. 


7. Given a function f:A—B and subsets W,X cA, prove f(WnNX)Cf(W)nf(X). 
8. Given a function f: A — B and subsets W,X cA, then f(WnX) = f(W)nf(X) is 
false in general. Produce a counterexample. 
9. Given a function f:A— B and subsets W,X cA, prove f(WUX) = f(W)Uf(X). 
10. Given f:A— B and subsets Y,Z CB, prove f-'(YnZ)=f-\(Y)n f-Z). 
11. Given f:A—B and subsets Y,Z <B, prove f-'(Y UZ) = f-"(Y) uf -'(Z). 
12. Consider f :A — B. Prove that f is injective if and only if X = f~1(f(X)) for all 
X CA. Prove that f is surjective if and only if f(f-'(Y)) =Y for all Y cB. 
13. Let f:A—B bea function, and X CA. Prove or disprove: f(f~1(f(X))) = f(X). 
14. Let f: A —B bea function, and Y CB. Prove or disprove: f-1(f(f-1(Y))) = f-1(Y). 


Free PDF version C\E 


CHAPTER 13 


Proofs in Calculus 


he proofs we have dealt with so far in this text have been largely proofs 

about integers, or about structures related to integers (divisibility, 

congruence modulo n, sets of integers, relations among integers, functions 
of integers, etc.). 

Of course mathematics is not restricted to just integers. Calculus is 
built on the system of real numbers R. Thus the main definitions in calculus 
cater to R. Consequently the proofs in calculus (which use the definitions) 
have a distinct flavor that is quite different from proofs in other areas of 
mathematics. In reading and writing proofs in calculus you will still use the 
main proof techniques (direct, contrapositive, contradiction), but it can take 
some time to adjust your thinking to the idiosyncrasies of R. This chapter 
is intended to ease that adjustment. It is an introduction to some of the 
ideas you will encounter in later courses in advanced calculus (also called 
analysis). This chapter is not needed for the remainder of the text, so it can 
be skipped without a loss of continuity. 

Single-variable calculus (the first two semesters of a standard calculus 
sequence) deals with functions f :R— R, or more generally f :X — R for 
X CR. Usually the domain X is an interval or a union of intervals. For 
example, f(x) = ainthy is a function f :(—o0, 1)U (1,2) U(2,00) — R, whereas 
f(x) = Vx has domain X = [0,o00), and f(x) = x?—x has domain X = (—o0,00) = R. 

Calculus rests on the idea of a limit, and it is the limit that separates 
calculus from algebra and trigonometry. We will study limits in sections 13.2 
through 13.6. It is assumed that you have had a prior course in calculus 
and already have some experience with limits. But our present treatment is 
more theoretical. It serves the double purpose of putting your earlier work 
on a firmer foundation while preparing you for more advanced studies. 

Another calculus topic (typically from the second semester of a standard 
course) concerns sequences and series, where functions f:N—R play a 
major role. We will turn to this in sections 13.7 and 13.8. 

All of this requires a result called the triangle inequality, so we begin 
there. 


The Triangle Inequality 245 


13.1 The Triangle Inequality 
Definitions in calculus and analysis use absolute value extensively. As you 
know, the absolute value of a real number x is the non-negative number 


re x ifx=0 
>) Hy ife<0. 


Fundamental properties of absolute value include |xy| = |x|-|y| and x < |x|. 
Another property—used often in proofs—is the triangle inequality: 


Theorem 13.1 (Triangle inequality) Ifx,y,z eR, then |x—y| <|x—z|+|z—yl. 


Proof. The name triangle inequality comes from the fact that the theorem 
can be interpreted as asserting that for any “triangle” on the number line, 
the length of any side never exceeds the sum of the lengths of the other two 
sides. Indeed, the distance between any two numbers a,b € R is |a—6b|. With 
this in mind, observe in the diagrams below that regardless of the order of 
x,y,z on the number line, the inequality |x — y| < |x-—z|+|z— | holds. 


lx-y| lz—yl lx-y| |x-2| lx-z| |x—y 
—_—e— —s 
<0 <_e__e— <—\_e—_e_e——>> 
x y zZ y x Zz Zz x y 
— 
|x-2| lz-y| le-y| 
lx-z| lz-yl lz-yl|_ |x—2l lz-y|_ |x-yl 
—_—ee aaa, LSS 
<\—_e——_e__#—>> <\—_e——_e_—>> <——e—__o—_o—__ 
———— 
lx—y| lx—y| |x-2| 


(These diagrams show x,y,z as distinct points. Ifx=y, x=z or y=z, then 
|x —y| <|x-2|+|z—y| holds automatically.) a 


The triangle inequality says the shortest route from x to y avoids z unless 
z lies between x and y. Several useful results flow from it. Put z =0 to get 


lx—yl<|x|+ly| for any x,yeER. (13.1) 
Replacing the y in this inequality with —y gives 


jn+y|<|x|+ly| for any x,yeR. (13.2) 


Also, by the triangle inequality, |x — 0| < |x —(—y)|+|—y—0|, which yields 
|x|-|yl<|x+y| for any x,yeR. (13.3) 


The three inequalities (13.1), (13.2) and (13.3) are very useful in proofs. 


Free PDF version COE 


246 Proofs in Calculus 


13.2 Definition of a Limit 


Limits are designed to deal with the following type of problem: We need to 
know how a certain function f(x) behaves when x is close to some number c. 
Perhaps f(c) is not even defined, so the graph of f looks something like what 
is shown below; a curve with a hole at a point (c,L). 


y y=fx) 9 y= f(x) 
f (x) 
| 
eter eeeeeemeeseeeae ; lim f(x) = L 
P(e) xc 


In this picture, for any x 4 c, the corresponding value f(x) is either greater 
than L or less than L. But the closer x is to c, the closer f(x) is to L, as 
illustrated on the right. We express this as lim f(x) =L. That is, the symbols 
lim f(x) stand for the number that f(x) approaches as x approaches c. 

Your calculus text probably presented an informal, intuitive definition 
of a limit that likely went something like this. 


Definition 13.1 (Informal definition of a limit) 
Suppose f is a function and c is a number. Then lim f(x) =L means that 
f(x) is arbitrarily close to L provided that x is sufficiently close to c. 


The idea is that no matter how close we want to make f(x) to L, we can 
be assured that it will be that close (or closer) if x is close enough to c. 

Definition 13.1 is sufficient for the first few semesters of calculus, but it 
is not adequate for deeper, more rigorous investigations. The problem is 
that it is too vague. What, exactly, is meant by close? Saying «x is “close” to c 
is not much better than saying that an integer n is “sort of even.” No proof 
can be done in the presence of such ambiguity. 

So this section’s first task is to motivate and develop a more rigorous and 
precise limit definition, the one used in advanced calculus. Achieving this 
goal forces us to grapple with the imprecise term close. What do we mean 
by close? Within 0.1 units? Within 0.001 or 0.00001 units, or even closer? 
We will make the definition precise by introducing a numeric, quantitative 
measure of closeness. 


Richard Hammack Book of Proof 


Definition of a Limit 247 


Standard practice uses the Greek letters « (epsilon) and 6 (delta) for 
variables representing how close f(x) is to L, and x is to c. For instance, 
x is within a distance of 6 from c if and only if c-6 <x <c+6, that is, 
-—6 <x-—c <6, or |x—c|<6. So for any real number 6 > 0 (no matter how 


small) the statement |x —c| <6 means that x is within 6 units from c. 
5 5 


c-6 c xX e¢et+6 


Likewise |f(x)—L| < e means that f(x) is within e units from L. Let’s apply 
these ideas to Definition 13.1, and transform it line by line. 


Informal definition ——_» Precise definition 

lim f(x)=L means that —_—__» lim f(x) =L means that 
f(x) is arbitrarily close to L ——__» for any e>0, |f(x)-L|<e 
provided that ——__» provided that 

x is sufficiently close to c —~__» 0<|x-—c|<6 for some 6>0. 


We have arrived at a precise definition of a limit. 


Definition 13.2. (Precise definition of a limit) 

Suppose f :X — Ris a function, where X CR, and ce R. Then lim f(x)=L 
means that for any real e > 0 (no matter how small), there is a real number 
6 >0 for which |f(x)—L| < « provided that 0 < |x—ce| <6. 


Figure 13.1 illustrates this. For any ¢ > 0, no matter how small, consider 
the narrow shaded band of points on the plane whose y-coordinates are 
between y=L-—e and y=L+e. Given this ¢, we can find another number 
6>0 such that the point (x, f(x)) is in the shaded band whenever «x is within 
6 units from c. In other words, |f(x)—L|<« provided that 0< |x—c|<6. 


- y = f(x) 


J ex 
c-6 c+6 


Figure 13.1. A graphic description of the limit definition. 


Free PDF version [S)= 


248 Proofs in Calculus 


Three comments are in order. First, we have slipped into Definition 13.2 
the expression 0 < |x—c| < 6 instead of |x—c| <6. This is to rule out the 
possibility x =c, as f(c) may not be defined, depending of f and c. 

Second, Definition 13.2 applies only if there is some 6 > 0 for which 
(ec -—6,c)U(c,c +6) is a subset of the domain of f. Otherwise the statement 
“| f(x)—- L| provided that 0<|x-—c| <6” is meaningless for some x, no matter 
how small 6 is. Thus lim f(x) makes sense only if f(x) is defined for all xe R 
that are “close to” c in the sense that x € (c—6,c)U(c,c +6) for some 6. 

Third, in symbolic form Definition 13.2 says lim f(x) =L if and only if 


Ve>0,46>0,(0<|x-cl <6) => (If(x)-Ll<e). (13.4) 


Thus proving lim f(x)=L amounts to proving that Statement (13.4) is true. 
One strategy for proving Statement (13.4) is the direct approach. Begin 
by assuming ¢ > 0. Then find a 6 for which (0 < |x-c| <6) > (|f(x)-L| <e). 
To find 6, try to extract a factor of |x—c| from |f(x)—L|. If you can do this, 
inspection usually tells you how small |x—c| needs to be to make |f(x)—L| <e. 
We will use this strategy in Example 13.1, which proves lim(3x +4)=10. 


Here f(x) =3x+4 and L = 10, so |f(x)—L| is |((8x+4)—10]. Also |x—e| is |x—2|. 
Example 13.1 Prove that lim(3x + 4) = 10. 


Proof. Suppose e>0. Note that |(3x + 4)—10| = [8x —6| = |3(x — 2)| = 3|x—2|. 
So if 6 = 3, then 0< |x —2|<6 yields |(3x + 4)— 10| = 3|x—2|<36=35 =e. 

In summary, for any ¢ > 0, there is a 6 = § for which 0 < |x —2| <6 implies 
\(8x+4)-10| <e. By Definition 13.2, lim(3x + 4)= 10. | 


Example 13.2 Prove that lim 5x? = 20. 
oar 
Proof. Suppose ¢>0. Notice that 
|f (x) — L| = |5x? — 20] = |5(x? — 4)| = [5(xe — 2)(x + 2)| = 5+ |x — 2 + ae + QI. 


Now we have a factor of |x — 2| in |f(x)—L|, but it is accompanied with |x + 2|. 
But if |x —2| is small, then x is close to 2, so |x + 2| should be close to 4. In 
fact, if |x—2| <1, then |x+2| = |(x-2)+4| < |x-2|+|4|<1+4=5. (Here we 
applied the inequality (13.2) from page 245.) In other words, if |x —2| <1, 
then |x +2|<5, and the above equation yields 


[f (x) — L| = |x? — 20| = 5-|x—2|-|x +2] <5-|x—2|-5 = 25|x— 2). 


Take 6 to be smaller than both 1 and 5;. Then 0 < |x—2| <6 implies 
[5x2 — 20| < 25-|x-2| < 256 < 25 35 =e. By Definition 13.2, lim 5x? = 20. |_| 
. 


Richard Hammack Book of Proof 


Limits That Do Not Exist 249 


The examples above (and the exercises below) involve limits that you 
probably regard as obvious. Our point is to illustrate Definition 13.2, not to 
compute difficult limits. Difficult limits come later (mostly in more advanced 
courses, not in this book) where Definition 13.2 will be used to great effect. 


Exercises for Section 13.2 


1. Prove that lim (8x —3)=387. 2. Prove that lim, (4x +6)=2. 
x x 

3. Prove that lim (x +2)=2. 4. Prove that lim (2x —7)=9. 
x— = 

5. Prove that Tim (x? —2)=7. 6. Prove that lim (4x2 +1) =5. 
x x— 


13.3 Limits That Do Not Exist 


Given a function f and a number c, there are two ways that lim f(x)=L can 
be false. First, there may be a different number M # L for which lim f(x)=M. 
Second, it may be that Statement (13.4) is false for all Le R. In such a case 
we say that lim f(x) does not exist. Contradiction is one way to prove that 
lim f(x) does not exist. Assume lim f(x)=L and produce a contradiction. 

|x —2| 
x-2 
Proof. Notice that f(2) is not defined, as it involves division by zero. Also, 
f(x) behaves differently depending on whether x is to the right or left of 2. 


If x > 2, then x —2 is positive, so |x—2|=x-2 and eae =1,s0 f(x)=5+38. 


Example 13.3 Suppose f(x) = st +2. Prove lim f(x) does not exist. 


If x < 2, then x—2 is negative, so |x—2| = —(x—2) and ca =-1,s0 f(x)=5+1. 
gx+3 ifx>2 
J 


Therefore f, graphed below, is a piecewise function f(x) = 


dxt+1 ifx<2. 


Suppose for the sake of contradiction that 
lim f(x) =L, where L is a real number. Let <= 1. 
By Definition 13.2, there is a real number 6 > 0 
for which 0<|x-2|<6 implies |f(«)-L|<1. 
Put a =2-%,s00<|a-2|<6. Hence |f(a)-L] < 1. 
Put b =2+8,s00<|b-2|<6. Hence |f(b)-L| <1. 
Further, f(a) <2 and f(b) >4, so 2< |f(6)— f(a)|. 
With this and the help of the inequality (13.1), 
we get a contradiction 2 < 2, as follows: 


2<|f(b)—- f(a)| =|(f(6)-L) - (f(a)-L)| s |f(b)-L|+|f(@)-L|<1+1=2. @ 


Free PDF version [S)= 


250 Proofs in Calculus 


Our next limit is a classic example of a non-existent limit. It often 
appears in first-semester calculus texts, where it is treated informally. 


1 
Example 13.4 Prove that lim sin (=) does not exist. 
x x 


As x approaches 0, the number 1 ; grows bigger, approaching infinity, so 
sin (4) just bounces up and down, faster and faster the closer x gets to 0. 


Intuitively, we would guess that the limit does not exist, because sin (+) does 
not approach any single number as x approaches 0. Here is a proof. 


Proof. Suppose for the sake of contradiction that lim a sin (5 )=L for LER. 
eamition 13.2 guarantees a number 6 for which 0 < |x-0| <6 implies 
|sin($)-L] <4. Bclect ely leree enough so that + <6. As 0<|4-0|<6, 
we ae | sin(=4-) -L| < $, and this yields |sin(n)- va |o- Els IL < 5: 


Next, take ¢¢ N large aabugtiee that Tan <5 Then 0<|z4- Tan -0| <6, so 
1 
we have sin| sat }-2] <b hich smplis to sin( +200 yenieienied, 
§t2ln 


Above we showed |L| < 5 I and |1-L| <4 5. Now apply the ineduaty (13.2) 
to get the contradiction Le 1,as1= In+a- L)|s|L|+|1-L|<5+5=1. Of 


1 
Example 13.5 Investigate lim xsin (=). 


This is like the previous example, except for the extra x. Because | sin( 
we expect «sin (4 ) to go to 0 as x goes to 0. Indeed, we prove lim xsin 


Proof. Given «> 0, let 6 =e. Suppose 0 < a 0] < 6. ieee enor lel <6, 
which is the same as |x| <e. We get |xsin(+) — 0] = |xsin(4) | =|x|-|sin(4)|< 
e|sin($)|<e-1=e. From this, Definition 13.2 gives lim xsin(2) = 0. a 


One final point. We remarked on page 248 that for lim f(x)=L to make 


sense, there must be a 6 for which f(x) is defined for all x € (e—6,c)U(ce,c +6). 
Thus, for example, following Definition 13.2 to the letter, we have to say that 
lim /x does not exist because \/x is not defined for all x € (-6,0)U(0,6). Your 


calculus text probably introduced a right-hand limit lim, Vx =0. Though 


this notion is not programmed into our Definition 13.2, you may revisit such 
embellishments in later courses. 


Richard Hammack Book of Proof 


Limit Laws 251 


Exercises for Section 13.3 
Prove that the following limits do not exist. 


1. lim log9 | 2. lim 3. lim 4 
; 1 ; 1 . 1 
4. lim cos (=) 5. lim xcot (2) 6. lim a 


13.4 Limit Laws 


When you studied Calculus I your text presented a number of limit laws, 
such as lim f(x)g(x) = (lim f (x)] : (lim g(w)). These laws allowed you to compute 
complex limits by reducing them to simpler limits, until the answer was at 
hand. But your calculus text probably did not prove the laws. Rather, you 
were asked to accept them as intuitively plausible (and useful) facts. 

Using Definition 13.2, we now present proofs of some limit laws. This 
serves two purposes. First, it puts your knowledge of calculus on a firmer 
foundation. Second, it highlights various strategies and thought patterns 
that are useful in limit proofs, which come to bear in later courses and work. 

The inequalities (13.1), (13.2) and (13.3) from page 245 play a crucial 
role. For convenience we repeat them here. For any x,y €R, 


lx—y| <|xl+lyl, lx+yl<lxl+lyl, and |x|—|ylslx+yl. 


We will use these frequently, usually without comment. 

Our first limit law concerns the constant function f(x)=a where ae R. 
Its graph is a horizontal line with y-intercept a. It should be obvious that 
lim f(x) =a for any real number c. Nonetheless, let’s prove this obvious fact. 


Theorem 13.2 (Constant function rule) IfaeR, then lim a=a. 


Proof. Suppose a€R. According to Definition 13.2, to prove lima =a, we 
x->C 


must show that for any <> 0, there is a 6 > 0 for which 0 < |x—c| <6 implies 
la—a|<e. This is almost too easy. Just let 6 = 1 (or any other number). Then 
la—a|<eis automatic, because |a —a| =0. a 


The identity function f :R— Ris f(x)=x. Next we prove lim f(x)=c. 


Theorem 13.3 (Identity function rule) IfceR, then lim x=C. 


Proof. Given «> 0, let 6=¢e. Then 0 < |x-—c| <6 implies |x—c| <e. By 
Definition 13.2, this means lim x=Cc. a 


Free PDF version [S)= 


252 Proofs in Calculus 


Theorem 13.4 (Constant multiple rule) 
If lim f(x) exists, and a€R, then lim af(x)=a lim f(x). 


Proof. Suppose lim f(x) exists. We must show lim af(x)=a lim f(x). Ifa =0, 
then this reduces to lim 0 =0, which is true by Theorem 13.2. Thus, for the 
remainder of the proof we can assume a # 0. 

Suppose lim f(x) =L. We must prove lim af(x)=aL. By Definition 13.2, 
this means we must show that for any « > 0, there is a 6 > 0 for which 
0 <|x-—c|<6 implies laf(x)-aL| <e. Let e>0. Because lim f(x) = L, there 
exists a 6 >0 for which 0 <|x—c| <6 implies | f(x)—L| < . So if 0<|x-cl<6, 
then |af(x)—aL| = |a(f(x)—-L)|=lal-|f@)-L| < lala =e. 

In summary, we’ve shown that for any <> 0, there is a 6 > 0 for which 
0 <|x-—c|<6 implies |af(x)-aL|<e. By Definition 13.2, limaf(x)=aL. 


Theorem 13.5 (Sum rule) 
If both lim f(x) and lim g(x) exist, then lim (f(x) + g(x)) = lim f(x) + lim g(x). 


Proof. Let lim f(x) =L and lim g(x) = M. We must prove lim (f(x) + g(x)) = 
L+M. To prove this, take «> 0. We need to find a corresponding 6 for which 
0 <|x—cl <6 implies |(f(«)+ g(x))—(L+M)| <e. With this in mind, notice that 


|(f (x) + g(x)) -(L + M)| = |(f(«)-L) + (g(@@)- M)| 
< | f(x) -L|+|g()— MI. (A) 
As lim f(x) =L, there is a 5’ >0 such that 0 <|x-—c| <6’ implies |f(x)-L|< §. 
As lim g(x) = M, there is a 6" > 0 such that 0 <|x—c| <6” implies |g(x)-M| < §. 
Now put 6 = min{6’,6"', meaning that 6 equals the smaller of 6’ and 6”. 
If 0 <|x—c| <6, then (A) gives |(f(x)+ g(x)) -(L+M)|<§+§=e. 


We’ve now shown that for any e > 0, there is a 6 > 0 for which 0 < |x—c| <6 
implies |(f(«) + g(x)) -(L+ M)|<e. Thus lim (f(x)+ g(x)) =L+M. a 


Theorem 13.6 (Difference rule) 

If both lim f(x) and lim g(x) exist, then lim (f(x) - g(x)) = lim f (x) - lim g(x). 

Proof. Combining the sum rule with the constant multiple rule gives 
lim (f(x) — g(x)) = lim (f(x) +(-1)- g(x)) = lim f(x) + lim (—1)- g(x) 


= lim f(x) +(—1)lim g(x) = lim f(x)—-limg(x). 


Richard Hammack Book of Proof 


Limit Laws 253 


Theorem 13.7 (Multiplication rule) 
If both lim f(x) and lim g(x) exist, then lim f(x)g(x) = (lim f(x)} (lim g(o)). 


Proof. Let lim f(x)=L and lim n g(x) = M. We must prove lim f(x) g(x) = 
To prove this, take «> 0. We need to find a corresponding 6 for which 
0 <|x—cl<6 implies |f(x)g(x)- LM|<e. With this in mind, notice that 


|f(x)g(x)-LM| = |(f(«)g(«) - Lg(x)) + (Lg(x)-LM)| 
<|f(x)g(x)-Lg(«)| + |Lg()-LM| 
= |(f(x)-L)g(x)| + |L(g(x)- M)| 
=|f(x)-L|-|g(x)|+|ZI-|g(x)- M]. (A) 


Because lim f(x) = L and lim g(x) = M, we can make the expressions |f(x)—L| 


and |L|-|g(x)— M| in (A) arbitrarily small by making |x — c| sufficiently small. 
But the term |/(x)—L|-|g(x)| is a problem. For all we know, |g(x)| could grow 
large as |f(x)—L| shrinks. To deal with this, choose some 6’ > 0 small enough 
so that 0 <|x—c| <6’ implies |g(x)— M| <1. Then as long as 0< |x-c|< 6’, 


|g(x)| = |(g(x)-M) + M| <|g(x~)- M|+|M| <1+|MI. 
Replacing the factor of |g(x)| in (A) with the larger quantity 1+|M|, we get 
|f (x)g(x)-LM| <|f(x)-L|-(1+|M|) +1LZI-|g(x)- MI, (B) 


which holds provided 0 < |x—c¢| < 6’. 

Choose 5” > 0 such that 0 < |x—c| < 6” implies |f(x)-L| < gay. 
Also choose 6" > 0 such that 0 < |x-c| < 6” implies |g(x)- M| < 
Now put 6 = min {6’,6",6""'. If 0<|x-—c| <6, then (B) becomes 


ee 
2(14+|L])° 


E E—E €E 


| f (x)g(x)-LM| < ay a 


-(1+|M|)+ILI- 


E 
2(1 +|M|) 
To summarize, we’ve shown that for any ¢ > 0, there is a 6 > 0 for which 
0 <|x-—cl<6 implies |f(x)g(x)- LM|<e. Therefore lim f(x)g(x)=LM. | 
Our final rule has proof similar to that of the multiplication rule. We 
just have to take a little extra care with the denominators. 


Theorem 13.8 (Division rule) 
oa 


If both lim f(x) and lim g(x) exist, and lim g(x) 40, then Lit : 
e g(x) ~ lim &(x) 


Free PDF version [S)= 


254 Proofs in Calculus 
Proof. Suppose lim f(x)=L and lim 1 g(x) = = M #0. We must prove lim f oS =% 


To prove this, take é>0. We need to find a corresponding 5 ‘for on 
0<|x—c|<6 implies | — #|<e. With this in mind, notice that 


g(x) 
f(x) a> Mf (x) -Lg(x) - [eee 
g(x) M| Mg(x) 7 Mg(x) 
1 
-|Zo(1-2)- 7 (ew -m] 
< zt 1)|+| 7 (ew) 
1 L 
= —. =f; ih Le —-M 7 A 
wa |FCe) haere a le@)—M| ” 


Because lim f(x)=L and lim g(x) = M, we can make the terms |f(x)- L| and 
|| -|g(x)- M| in (A) arbitrarily small by making |x - c| sufficiently small. 
To deal with the factor war choose 6’ > 0 so that 0 < |x—c| < 6’ implies 
|g(x)-M| < ie So if 0 < |x—c| <6’, the inequality (13.3) assures us 

|g(x)| =|M+(g(x)-M)| =|M|- eer || — Bl = A 


poe is, |g(x)| > el | and consequently GI < Replacing the occurrences 


mi I 
of — rag in (A) with the larger value Fae we get 
f(x) L 
—-— L|+ B 
g(x) M ara ‘lA@)-2| ©“ 
which holds provided 0 < |x —c| < 6’. Two cases finish the proof. 


Case 1. Suppose L # 0. Choose 6” > 0 so 0 < |x—c| < 6” implies | f(x)- Hs elu : 


Also choose 5” > 0 so that 0 < |x—c| < 6” implies |g(x)- M| < |e a Put 
6 =min{6',6",6'"}. If 0<|x-c| <6, then (B) yields 


M2 
4L 


—E €E 


f(x) 2 —e|M| 2L ses 
ae 


a(x) | iM 4. |Me 


=€E, 


Case 2. Suppose L =0. Let 6” >0 be such that 0 < |x—c| < 6” implies 
|f(x)-L]| < eM Putting 6 = min {6’,6’}, the inequality (B) becomes 


fx) L|_ 2 elMl_ 


gia) M|~ iM 2 


In each case we have shown that for any < > 0, there is a 6 > 0 for which 
0 <|x-c|<6 implies fers - | <¢, so the proof is finished. a 


Richard Hammack Book of Proof 


Limit Laws 255 


Though you may not have proved any limit laws in your calculus course, 
you used them extensively. A common situation involved lim f(x), where f(c) 
was undefined because of a zero denominator. You learned to overcome this 
by algebraically canceling the offending part of the denominator. 

1 


Here x approaches 1, but simply plugging in x =1 gives 2 =| i= = 9 9 (undefined). 
So we apply whatever algebra is needed to cancel the caemaee 1-x, 
and follow this with limit laws: 


(multiply quotient by 1 = =) 


= lim Awe (distribute x on top) 
1 

=lim - (cancel the (1—«x)) 
x1 x 
lim 1 1 

Seo Se a, (apply limit laws) 
lim x 1 
es 


Exercises for Section 13.4 

1. Given two or more functions /}, fo,...,fn, Suppose that lim fi(x) exists for each 
1<i<n. Prove that lim (f,(x)+ fo(x)+-+++ fn(o)) = lim fi (or) + lim fo(x)+---+ lim fn (x). 
Use induction on n, with Theorem 13.5 serving as the base case. 


2. Given two or more functions /},fo,...,fn, Suppose that lim fi(x) exists for each 
1<i<n. Prove that lim(f1(x)f2(x)---fn()) = (lim fio) - (lim fo(e))---(lim fn). 


Use induction on n, with Theorem 13.7 serving as the base case. 


3. Use the previous two exercises and the constant multiple rule (Theorem 13.4) to 
prove that that if f(x) is a polynomial, then lim f(x) = f(c) for any ce R. 


4, Use Exercise 3 with a limit law to prove that if as is a rational function (a 
polynomial divided by a polynomial), and g(c) 40, then lim 0 OS = Hon 


5. Use Definition 13.2 to prove that limits are unique in the sense that if lim f(x)=L 
and lim f(x) = M, then L = M. 


6. Prove the squeeze theorem: Suppose g(x) < f(x) < A(x) for all x € R satisfying 
0<|x—c|<6 for some 6>0. If lim g(x)=L= lim h(x), then lim f(x) =L. 


Free PDF version C\E 


256 Proofs in Calculus 


13.5 Continuity and Derivatives 


A major purpose of limits is that they can give information about how a 
function behaves near a “bad point” x = c. Even if f(c) is not defined, it may 
be that lim f(x) =L, for some number L. In this event we know that f(x) 
becomes ever closer to L as x approaches the forbidden c. 

Of course not every value x =c is a “bad point.” It could be that f(c) is 
defined, and, moreover, lim f(x) = f(c). If this is the case for every c in the 
domain of f(x), then we say that f is continuous. Issues concerning whether 
or not f is continuous are called issues of continuity. 

In a first course in calculus it is easy to overlook the huge importance 
of continuity. And happily, we can (in a first course) almost ignore it. But 
in fact, the theoretical foundation of calculus rests on continuity. Roughly 
speaking, there are countless theorems having the form 


If | f is continuous, | then | f has some significant property. 


Continuity allows us to draw certain important conclusions about a function. 
Here is its definition. 


Definition 13.3 A function f(x) is continuous at x =c if lim f(x) = fe). 
Note that this means all of the following three conditions must be met: 
1. f(c) is defined, 
2. lim f(x) exists, 


3. lim f(x) = f(c). 
If one or more of these conditions fail, then f(x) is discontinuous at c. 


To illustrate this definition, five functions f(x) are graphed below. Only 
the function on the far right is continuous at x =. 


Cc c c 


1,3 fail 3 fails 2,3 fail 1,2,3 fail 1,2,3 all hold 


f(x) is discontinuous at x =c ra 


f(x) is continuous at x=c 


Richard Hammack Book of Proof 


Continuity and Derivatives 257 


Most familiar functions are continuous at each point x = c in their domain. 
For instance, exercises 3 and 4 in the previous section imply that polynomials 
and rational functions are continuous at any number c in their domains. 

One application of continuity is a limit law for composition. The previous 
section might prompt us to conjecture that lim f(g(x)) =f (lim g(w)). However, 
this does not hold without an assumption of continuity. 


Theorem 13.9 (Composition rule) 
If lim g(x) =L and f is continuous at x =L, then lim f(g(x)) =f (lim g(x)). 


Proof. Suppose lim g(x)=L and f is continuous at x= L. We need to show 
lim f (g(x)) = f(L). According to Definition 13.2, for any « > 0 we must find a 
corresponding 6 > 0 for which 0 <|x-—c| <6 implies |f(g(x)) - f(L)| <e. 

So let e> 0. As f is continuous at L, Definition 13.3 yields lim f(x) = f(L). 


From this, we know there is a real number 6’ > 0 for which 
|x -L| <6’ implies |f(x)— f(L)|<e. (A) 


But also, from lim g(x) =L, we know that there is a real number 6 > 0 for 
which 0 < |x-—c| <6 implies |g(x)-L| <6’. 

If 0 < |x—c| <6, then we have |g(x)-L| < 6’, and from this (A) yields 
\f(g(x)) - F(Z)| <e. Thus lim f (g(x)) = f(L), and the proof is complete. | 


In calculus you learned that the derivative of a real-valued function f 
is another function f’ for which f'(c) is defined as 
ft)—fCe) 
x—-C 


/"Co= lim 


provided the limit exists (in which case we say f is differentiable at c). 
You may recall that differentiability implies continuity. 


Theorem 13.10 If f is differentiable at c, then f is continuous at c. 
Proof. Suppose f is differentiable at c, so lim Te) fle) = f'(c). Write f(x) as 


F(x) = FPO), 
x—-C 


Taking limits of both sides and using limit laws, 


f(x) - fc) 
a). 


x-—c)+f(c). 


lim f(x) = [im lim(x— ¢)] + lim f(c) = f"(c)-0+ f(c) = FCC). 


Thus lim f(x) = f(c), which means f is continuous at c. | 


Free PDF version [S)= 


258 Proofs in Calculus 


Exercises for Section 13.5 
1. Prove that the function f(x) = \/x is continuous at any number c > 0. Deduce that 
lim V g(x) = lim g@), provided lim g(x) exists and is greater than zero. 
2. Show that the condition of continuity in Theorem 13.9 is necessary by finding 


functions f and g for which lim g(x)=L, and f is not continuous at x = L, and 


lim f (g(x) 4 f (lim g(@)). 


13.6 Limits at Infinity 


For some functions f(x), limits such as Jim f(x) and jim. f(x) make sense. 
Consider the function graphed below. As x moves to the right (towards 
positive infinity) the corresponding f(x) value approaches 2. We express 
this in symbols as Jim. f(x)=2. Such a limit is called a limit at infinity, 
which is a bit of a misnomer because x is never “at” infinity, just moving 
toward it. 


The graph squeezes in on the dashed horizontal line y = 2 as x moves to oo. 
This line is called a horizontal asymptote of the function f(x). It is not a 
part of the graph, but it helps us visualize the behavior of f(x) as x grows. 

Also, in this picture, as x moves to the left (toward negative infinity), 
the corresponding value f(x) approaches —1. We express this in symbols as 
jim. f(x)=-—1. The horizontal line y = —1 is a second horizontal asymptote 
of this function f(x). 

In general, Jim. f(x)=L means that f(x) is arbitrarily close to L, provided 
that x is sufficiently large (i.e., “provided that x is sufficiently close to co”). 
In other words, given any « > 0, there is a number N > 0 (possibly quite 
large) such that x > N implies |f(x)—L]|<e. This is illustrated below. 


Richard Hammack Book of Proof 


Limits at Infinity 259 


y y= f(x) 


Analogously, for x approaching —oo, we say jim. f(x) =L means that f(x) 
is arbitrarily close to L, provided x is a sufficiently close to —oo. In other 
words, given any ¢ > 0, there is a number N < 0 such that x < N implies 
|f(x)-L|<e. Here is a summary of these ideas. 


Definition 13.4 (Limits at Infinity) 
1. The statement Jim f(x)=L means that for any real ce >0, there is a 


number N >0 for which x > N implies |f(x)-L| <e. 


2. The statement Jim. f(x)=L means that for any real <> 0, there is a 
number N <0 for which x < N implies |f(x)—-L| <e. 


sin(x) 


Example 13.7 Investigate Jim 


For any x€ R, we know that —1<sin(x) < 1. Consequently we would expect 
since) to be very small when x is large, that is, we expect lim sin) = =0. 


Let us use Definition 13.4 to prove this. Given e> 0, pub N= 7 1. Ifx>N, 
then x > 1, so 1<e, and hence ~e < } sin(x) < e, meaning |"™| <e. 
In summary, given ¢ > 0, there is an N > 0 for which x > N implies 


| 2 _0|<e. By Definition 13.4, lim ™ = 0, 
x—CO 


y 


_ sin(x) 


x 


sin(x) 
x 


In a similar manner we can prove lim =0. Thus the x-axis y= 0 is 
Xx—>—COo 


sini) 


a horizontal asymptote to , as illustrated above. 


Free PDF version [S)= 


260 Proofs in Calculus 


Of course, not every limit at infinity will exist. Consider lim x?. As x 
X00 


goes to infinity, the quantity x? approaches infinity too. Common sense says 
the limit does not exist because x? eventually exceeds any finite number L. 
But it’s good practice to prove this common-sensical statement. 

Suppose for the sake of contradiction that Jim. x? =L for some Le R. Let 
€=1, and apply Definition 13.4 to get a number N for which x > N implies 
|? -L| <1. The inequality (13.3) yields |x?| - |L| = |x?|-—|-L| < |x?+(-L)|= 
|x? -L| <1. In other words, x? -|L| <1, or x? <1+/|L| for allx>N. But this 
is false for those x that are bigger than both N and 1+|L|, a contradiction. 

Even though lim x? does not exist, we allow the notation Jim x? = 00 


to indicate that x? grows without bound as x goes to infinity. In general, 
Jim f(x) = 00 means that f(x) eventually exceeds any number L: 
1. iim f(x) =00 means that for any real number L, there is a positive N 
for which x > N implies f(x) >L. 
2. Jim f(x) = —co means that for any real number L, there is a positive N 


for which x > N implies f(x) <L. 


Limits of the form Jim) f (x) = +00 play a small role in the next section. 


Exercises for Section 13.6 


Use Definition 13.4 to prove the following results. (Where appropriate, you may 
wish to adapt the corresponding proofs from Section 13.4.) 


1. lim jr =OifneN. 2. lim 5t*t = 5. 8. IfaeR, then lim a=a. 
4, If Jim f(x) exists, and ae R, then iim af(x)=a iim. f(x). 

5. If both Jim f(x) and Jim g(x) exist, then Jim (f (x) + g(x)) = Jim f(«)+ lim g(x). 
6. If both lim f(x) and lim g(x) exist, then lim f(x)g(x)= (lim f(x))-(Jim go). 
7. If both Jim f(x) and Jim g(x) exist, then Jim (f(x) - g(x)) = Jim, f(x) — lim g(x). 


f(x) lim f(x) 
8. If both lim f(x) and lim g(x) exist, and lim g(x) 40, then lim —— = ~~ : 
Ee ae Pacers x00 g(x) jim, g(x) 


9. If lim g(x)=L and f is continuous at x=L, then lim f(g(x)) = f (Jim g()). 


10. Prove that Jim sin(x) does not exist. 


Richard Hammack Book of Proof 


Sequences 261 


13.7 Sequences 


Our final two sections treat sequences and series, topics usually covered in 
a second semester of calculus. 
Recall that a sequence is an infinitely long list of real numbers 


@1, 42, 43, G4, M5, .«..- 


The number a is called the first term, ag is the second term, az is the third 


term, and so on. For example, the sequence 
93 4 5 6 7 
> 4> 9> 16> 25> 36> °° 


has nth term a, = ae The nth term is sometimes called the general term. 


We can define a sequence by giving a formula for its general term. The 


‘ _7yyntl . 
sequence with general term a, = a ia aig 


2 3.é«4 5 66 7 
> Q> 3? 4? 5? Bo 0- 


We denote a sequence with nth term a,, as {a,}. For example, the three 
sequences displayed above are denoted compactly as {a,} and {2+} and 


{cer oen} respectively. In this manner, the sequence {n?+ 1} is 
2, 5, 10, 17, 26, 37, ---. 


Sometimes we define a sequence by writing down its first several terms, 
with the agreement that the general term is implied by the number pattern. 
For instance, the sequence 


1, 4, 9, 16, 25, 36 --- 


is understood to be {n”} because n? is the most obvious formula that matches 
the first six terms. But be alert to the fact that a finite number of terms can 
never completely and unambiguously specify an infinite sequence. For all 
we know, the nth term of 1,4,9,16,25,... might not be a, =n”, but actually 
an =n? +(n—1)(n—2)(n—-3)\(n—4)(n—5). This agrees with the first five listed 
terms, but the sixth term is ag = 156, not the expected ag = 36. 

A sequence {a,} can be regarded as a function f :N—R, where f(n) = an. 
For example, the sequence {1- +} is the function f(n)=1- 1. In this sense 
we can graph a sequence; but the graph looks like a string of beads rather 
than a curve, because the domain is N, not R. Here is the graph of {1- 1 . 


. <eeee SE Ceeeeesees er eres eee 
e® 


Nin 
e 


, a a ae a {1+} + }__14_+_}_14_+_+_+_4+_+_4-_4+-_+_4-_4+_+_4_4_ +> ]]] 
2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 


Free PDF version [S)= 


262 Proofs in Calculus 


Roughly speaking, we say a sequence {a,,} converges to a number L if 
the numbers a, get closer and closer to L as n gets bigger and bigger. 

For example, the sequence {1- +} from the previous page converges to 
L=1, because as n gets big, the number 1- + approaches 1. 

In general, proving facts about convergence requires a precise definition. 
For this, we can adapt the definition of a limit at infinity from Section 13.6. 
The sequence {a,,} converges to L if a, can be made arbitrarily close to L by 
choosing n sufficiently large. Here is the exact definition. 


Definition 13.5 A sequence {a,} converges to a number L € R provided 
that for any «> 0 there is an NEN for which n> WN implies |a, -L|<e. 


If {an} converges to L, we denote this state of affairs as lim a, =L. 


If {a,} does not converge to any number L, then we say it diverges. 


Definition 13.5 is illustrated below. For any <> 0 (no matter how small), 
there is an integer N for which the terms a,, of the sequence lie between 
L-e and L-+e provided n>N. Smaller values of ¢ require larger values of N. 
But no matter how small < is, there is a (possibly quite large) number N for 
which a,, is within ¢ units from L when n>WN. 


n 


+ + + + + + + + + + + + + + + + + + + + + + + + + 
9 10 11 12 138 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 


i 
N 


For our first example, let’s return to the sequence {1- +}, which is 
graphed on the previous page (page 261). Notice that as n gets large, + 
approaches 0, and 1- 1 approaches 1. So we can see that the sequence 
converges to 1. But let’s prove this, in order to illustrate Definition 13.5. 


Example 13.8 Prove that the sequence {1- +4} converges to 1. 


Proof. Suppose ¢ > 0. Choose an integer N > 4, so that 4 <e. Then ifn>N 
we have |a, —1|=|(1-+)-1]=4+< 4 <e. By Definition 13.5 the sequence 
{1- +} converges to 1. a 


Richard Hammack Book of Proof 


Sequences 263 


Example 13.9 Investigate the sequence {porn} 


The first few terms of this sequence are 2, 3, :, 3, g Byes, The terms 


alternate between positive and negative, with the odd terms positive and 
the even terms negative. Here is a graph of the sequence. 


n 


+$—+—_-—_ + —_—_-—_¢-—_-_-__-—_+—_+-—_+—_+—_+—_-—_-—_—_+—_+—_+—_+-_1+ 
12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 


The picture suggests that as n increases, the terms bounce back and forth 
between values that are alternately close to 1 and —1. This is also evident by 
inspection of the general term a, = Cur @rD | because nt} approaches 1 as 
n grows toward infinity, while the power of —1 alternates the sign. Because 
the general term does not approach any single number, it appears that this 
sequence diverges. Now let’s set out to prove this. Our proof formalizes the 
idea that if the sequence did converge to a number L, then L would have to 


be within ¢ units of both —1 and 1, and this is impossible if ¢ <1. 


Proof. Suppose for the sake of contradiction that the sequence four*@rm} 


converges to a real number L. Let ¢=1. By Definition 13.5 there is an NEN 
nt+1 
for which n > N implies | -Y—*® — | <1. 


If n is odd, then the nth term of the sequence is a, = I 
an= (p41) _ _ n+ 


n+1 
(-1)"**(n+1) = uid >1. 


For n even, the nth term of the sequence is is - <-1. 
Take an odd number m > N and an even number n> WN. The above three 
lines yield 


2=1-(-1) < an-ay (because 1<a,, and 1<~—a,) 
7 lam — an| (am —an is positive) 
= |(an-L)-(a,-L)| (add 0O=L-L to aman) 
< ja, —L|+|a,-L| (using |x — y| <|x|+|y!) 
< 14122, (because |a, —L| <1 when n> WN) 


Thus 2 < 2, which is a contradiction. Consequently the series diverges. Mf 


Free PDF version [S)= 


264 Proofs in Calculus 


For another example of a sequence that diverges, consider 1,4,9,16,25,... 
whose nth term is a, =n”. Clearly this diverges, because Jim, n2 = oo, which 
is not a number. In such a case we say that the sequence diverges to oo. 


Definition 13.6 (Divergence to infinity) 
1. We say a sequence {a,} diverges to oo if Jim an = 00. This means 
that for any L > 0, there is a positive N for which n > N implies a, > L. 


2. We say a sequence {an} diverges to —co if lim a, = —oo. This means 


that for any L <0, there is a positive N for which n > N implies a, < L. 


This definition spells out a condition called divergence to oo. But we 
haven't yet proved that a sequence meeting this condition actually diverges 
in the sense of Definition 13.5. Exercise 7 below asks you to do this. 


Exercises for Section 13.7 


1. Prove that 2 converges to 0. 2. Prove that { 5+ 3,\ converges to 5. 
3. Prove that 2h a diverges to oo. 4. Prove that {1- a} converges to 1. 
5. Prove that {gnttt converges to 2. 6. Prove that { fae ga | converges to 3. 


7. Prove that if a sequence diverges to infinity, then it diverges. 
8. Prove that the constant sequence c,c,c,c,... converges to c, for any cE R. 


9. Prove that if {a,} converges to L, and ce R, then the sequence {ca,} converges 
to cL. 


10. Prove that if {a,} converges to L and {b,} converges to M, then the sequence 
{an +bn} converges to L+M. 


11. Prove that if {a,} converges to L and {b,} converges to M, then the sequence 
{anb,} converges to LM. 


12. Prove that if {a,} converges to L and {b,} converges to M #0, then the sequence 
{5} converges to 7. (You may assume b, #0 for each n€N.) 


13. For any sequence {a,,}, there is a corresponding sequence {|a,,|}. Prove that if 
{lanl} converges to 0, then {a,} converges to 0. Give an example of a sequence 
{an} for which {|a,|} converges to a number L # 0, but {a,} diverges. 


14. Suppose that {a,}, {b,}, and {c,} are sequences for which a, < by < cy for all 
sufficiently large n. (That is, ay <b, < cn for alln > M for some integer M.) Prove 
that if {a,} and {c,} converge to L, then {b,} also converges to L. 


Richard Hammack Book of Proof 


Series 265 


13.8 Series 


You may recall from your calculus course that there is a big difference 
between a sequence and a series. 

A sequence is an infinite list a1, a2, a3, a4, a5, a6, °° 

But a series is an infinite sum a,+aq+a3+a4+a5+agt:::. 

We use the notation {a,,} to denote the sequence aj, a2, a3, a4,..., but we 
use sigma notation to denote a series: 


CO 
Yi a, = ay+agtagt+agta5t+agt:-. 


ae 1 1 1 1 1 1 
Vis = s4+54+¢ 54+ 54+ 545 4+°° 
Qk 2 4 8 16 32 64 


equals oo because every fraction in the infinite sum is greater than 1. 

Series are significant in calculus because many complicated functions 
can be expressed as series involving terms built from simple algebraic 
operations. For example, your calculus course may have developed the 
Maclaurin series for various functions, such as 


cal on 8 ae x2 x4 x8 x8 10 
cos(x) = au (2k)! os 2! = 4! 6! r 8! 10! ¥ 


But before we make any progress with series, it is essential that we 
clearly specify what it means to add up infinitely many numbers. We need 
to understand the situations in which this does and does not make sense. 

The key to codifying whether or not a series 


[oe) 
> an = A{,1A97TA37TA4TA5+TAG+TA7T+Ag+aAgt::: 


=~ 
Il 
a 


adds up to a finite number is to terminate it at an arbitrary nth term: 


n 
> my = 4110497037047 A51+061'''+Qn. 


> 
Il 
Hb 


This is sum called the nth partial sum of the series, and is denoted as sy. 


Free PDF version [S)= 


266 Proofs in Calculus 


The series has a partial sum s, for each positive integer n: 


$1 = a] 

S82 = ayjt+ag 

83 = a,t+a21+a3 

84 = a,+a9+aA3+aQ4 

85 = a1,+aq+a3+Q4+a5 
n 

Sn = aitagtagtag+a5st--t+an = ) ap 
k=1 


CO 
If indeed the infinite sum S = Y a, makes sense, then we expect that the 
k=1 
partial sum s, = . a, is a very good approximation to S when n is large. 
k=1 
Moreover, the larger n gets, the closer s, should be to S. In other words, 
the sequence s1,89,83,84,85,... of partial sums should converge to S. This 
leads to our main definition. We say that an infinite series converges if its 
sequence of partial sums converges. 


Definition 13.7 A series 3 a, converges to a real number S if its 


k=1 fee) 
sequence of partial sums {s,,} converges to S. In this case we say ) az =S. 
k=1 


CO CO 
We say > a, diverges if the sequence {s,,} diverges. In this case } az 

k=1 k=1 
does not make sense as a sum or does not sum to a finite number. 


Example 13.10 Prove that y a =1. 
k=1 


Proof. Consider the partial sum s,, = a + ra + + feet a We can get a neat 


formula for s, by noting s, = 2s, —s,. Then simplify and cancel like terms: 


- _ 1 1 1 1 1 1 1 1 1 1 
Sn=28n—Sn=2l ort oat og t+ gett on (art get gett eat pe 
(2 2 2 2 2 1 1 1 1 1 
~ git 921 931° t on-1* on git 927 931° t on-1* on 
- 1 1 1 1 1 1 1 1 1\_ 1 
= Rie tog one Pond gi’ 921 931 * on-1* on =1 Qn* 


Thus s, = 1- 37, so the sequence of partial sums is {s,} = {1- 47}, which 


converges to 1 by Exercise 13.7.4. Definition 13.7 yields y oa =1. i) 
k=1 


Richard Hammack Book of Proof 


Series 267 


Despite the previous example, in practice definitions 13.7 and 13.5 are 
rarely used to prove that a particular sequence or series converges to a 
particular number. Instead we tend to use a multitude of convergence tests 
that are covered in a typical calculus course. Examples of such tests include 
the comparison test, the ratio test, the root test and the alternating series test. 
You learned how to use these tests and techniques in your calculus course, 
though that course may not have actually proved that the tests were valid. 
The point of our present discussion is that definitions 13.7 and 13.5 can be 
used to prove the tests. To underscore this point, this section’s exercises 
ask you to prove several convergence tests. 

By way of illustration, we close with a proof of a theorem that leads to a 
test for divergence. 


Theorem 13.11 If 3 a; converges, then the sequence {a,,} converges to 0. 


Proof. We use direct proof. Suppose z ap, converges, and say Dan =8. 
=1 = 

Then by Definition 13.7, the sequence of partial sums {s,,} converges to S. 
From this, Definition 13.5 says that for any c > 0 there is an N EN for which 
n> WN implies |s, —S|<e. Thus also n—1>N implies |s,-1—S|<e. 

We need to show that {a,,} converges to 0. So take « > 0. By the previous 
paragraph, there is an N’ € N for which n > N’ implies |s, — S| < § and 
|Sn-1-S|< §. Notice that a, =s,—sn-1 for any n> 2. So ifn >N’ we have 


|@n—0| =|8n—8n-1| = |(8n- 8) - (8n-1-S)| 


< |s,—-S|+|sn-1-S| < a ae 


Therefore, by Definition 13.5, the sequence {a,} converges to 0. | 


The contrapositive of this theorem is a convenient test for divergence: 
Corollary 13.1 (Divergence test) If {a,,} diverges, or if it converges to a 


CO 
non-zero number, then > a, diverges. 
k=1 


[o,e) 
For example, according to the divergence test, the series ) (1-7) 
k=1 


; 1 SS (-1e+k+1) 
diverges, because the sequence {1—-} converges to 1. Also, Y ~~“ 
k=1 


diverges because {cure} diverges. (See Example 13.9 on page 263.) 


The divergence test gives only a criterion for deciding if a series diverges. 


[oe 
It says nothing about convergence. If {a,} converges to 0, then ) a, may 
k=1 


Free PDF version [S)= 


268 Proofs in Calculus 


[o) 
or may not converge, depending on the particular series. Certainly if Y a, 
k=1 
converges, then {a,,} converges to 0, by Theorem 13.11. But {a,} converging 
[oe) 
to 0 does not necessarily mean that > a, converges. A significant example 
=1 


of this is the so-called harmonic series: 


According to Exercise 21 in Chapter 10, if we go out as far as 2” terms, then 
the 2”th partial sum satisfies 


1 n 
tito titest+ +— 21+-. 
4 2?—1 2” 2 


Tet 8 
Because 1+ } grows arbitrarily large as n increases, the sequence of partial 
sums diverges to oo. Consequently the harmonic series diverges. 


Exercises for Section 13.8 


Use Definition 13.7 (and Definition 13.5, as needed) to prove the following results. 
Solutions for these exercises are not included in the back of the book, for they can 


be found in most good calculus texts. In the exercises we abbreviate y ap as Dap. 


1. A geometric series is one having the form a+ar+ar?+ar°---, where a,r€R. (The 


first term in the sum is a, and beyond that, the kth term is r times the previous 
term.) Prove that if |r| <1, then the series converges to ;*. Also, ifa #0 and 
|r| = 1, then the series diverges. (If you need guidance, you may draw inspiration 
from Example 13.10, which concerns a geometric series with a =r= 5.) 


2. Prove the comparison test: Suppose ) az and > b; are series. If 0< az < bz for 
each k, and > bz converges, then ).az converges. Also, if 0< bz <a, for each k, 
and > b, diverges, then )\az diverges. 


3. Prove the limit comparison test: Suppose ).az, and > bz are series for which 
ap, bz > 0 for each k. If lim ie =0 and > bz converges, then >) a, converges. (Your 


proof may use any of the above exercises.) 


4, Prove the absolute convergence test: Let Yaz be a series. If ¥|az| converges, then 
Yaz converges. (Your proof may use any of the above exercises.) 


5. Prove the ratio test: Given a series ) az with each az positive, if tim ao =L<1, 
—Cco 
then > az converges. Also, if L > 1, then Ya; diverges. (Your proof may use any 


of the above exercises.) 


Richard Hammack Book of Proof 


CHAPTER 14 


Cardinality of Sets 


his chapter is all about cardinality of sets. At first this looks like a very 

simple concept. To find the cardinality of a set, just count its elements. 

If A = {a,b,c,d}, then |A| =4; if B= {ne Z:-5 <n <5}, then |B| = 11. In this 
case |A|<|B|. What could be simpler than that? 

Actually, the idea of cardinality becomes quite subtle when the sets 
are infinite. The main point of this chapter is to explain how there are 
numerous different kinds of infinity, and some infinities are bigger than 
others. Two sets A and B can both have infinite cardinality, yet |A| < |B]. 


14.1 Sets with Equal Cardinalities 


We begin with a discussion of what it means for two sets to have the same 
cardinality. Up until this point we’ve said |A| =|B| if A and B have the same 
number of elements: Count the elements of A, then count the elements of 
B. If you get the same number, then |A| = |B|. 

Although this is a fine strategy if the sets are finite (and not too big!), 
it doesn’t apply to infinite sets because we’d never be done counting their 
elements. We need a new approach that applies to both finite and infinite 
sets. Here it is: 


Definition 14.1 Two sets A and B have the same cardinality, written 
|A| = |B|, if there exists a bijective function f : A — B. If no such bijective 
f exists, then the sets have unequal cardinalities, written |A|  |B|. 


A B 
P() 


The above picture illustrates our definition. There is a bijective function 
f:A—B, so |A|=|B|. The function f matches up A with B. Think of f as 
describing how to overlay A onto B so that they fit together perfectly. 


270 Cardinality of Sets 


On the other hand, if A and B are as indicated in either of the following 
figures, then there can be no bijection f : A — B. (The best we can do isa 
function that is either injective or surjective, but not both.) Therefore the 
definition says |A| 4 |B| in these cases. 


Example 14.1 The sets A= {ne Z:0<n<5} and B= {neZ:-5<n<0} 
have the same cardinality because there is a bijective function f:A—B 
given by the rule f(n)=-—n. 


Several comments are in order. First, if |A| =|B|, there can be lots of 
bijective functions from A to B. We only need to find one of them in order to 
conclude |A| = |B|. Second, as bijective functions play such a big role here, 
we use the word bijection to mean bijective function. Thus the function 
f(n)=—n from Example 14.1 is a bijection. Also, an injective function is 
called an injection and a surjective function is called a surjection. 

We emphasize and reiterate that Definition 14.1 applies to finite as well 
as infinite sets. If A and B are infinite, then |A| = |B| provided there exists 
a bijection f : A — B. If no such bijection exists, then |A| 4 |B]. 


Example 14.2. This example shows that |N|=|Z|. To see why this is true, 
notice that the following table describes a bijection f :N— Z. 


n |1 2 3 4 5 6 7 8 9 10 11 12 18 14 15 


fo 2 ae Sw ea eS Ge ST 


Notice that f is described in such a way that it is both injective and surjective. 
Every integer appears exactly once on the infinitely long second row. Thus, 
according to the table, given any b € Z there is some natural number n 
with f(n) = b, so f is surjective. It is injective because the way the table 
is constructed forces f(m) 4 f(n) whenever m #n. Because of this bijection 
f :N—Z, we must conclude from Definition 14.1 that |N| =|Z]. 


Example 14.2 may seem slightly unsettling. On one hand it makes sense 
that |N| =|Z| because N and Z are both infinite, so their cardinalities are 
both “infinity.” On the other hand, Z may seem twice as large as N because Z 


Richard Hammack Book of Proof 


Sets with Equal Cardinalities 271 


has all the negative integers as well as the positive ones. Definition 14.1 
settles the issue. Because the bijection f :N— Z matches up N with Z, it 
follows that |N| =|Z|. We summarize this with a theorem. 


Theorem 14.1 There exists a bijection f :N— Z. Therefore |N| = |Z]. 


The fact that N and Z have the same cardinality might prompt us to 
compare the cardinalities of other infinite sets. How, for example, do N and 
R compare? Let’s turn our attention to this. 

In fact, |N| 4|R|. This was first recognized by Georg Cantor (1845-1918), 
who devised an ingenious argument to show that there are no surjective 
functions f:N—R. (This in turn implies that there can be no bijections 
f :N—R, so |N| 4|R| by Definition 14.1.) 

We now describe Cantor’s argument for why there are no surjections 
f:N—R. We will reason informally, rather than writing out an exact proof. 
Take any arbitrary function f :N— R. Here’s why f can’t be surjective: 


Imagine making a table for f, where values of n in N are in the left-hand 
column and the corresponding values f(n) are on the right. The first few 
entries might look something as follows. In this table, the real numbers f(n) 
are written with all their decimal places trailing off to the right. Thus, even 
though f(1) happens to be the real number 0.4, we write it as 0.40000000...., 
etc. 


f(n) 


-40000000000000... 
.90060708666900... 
.590500940044101... 
.590704008048050... 
.90026000000506... 
.82809582050020... 
-890505550655808... 
.72080640000448... 
-55000088880077... 
.90020722078051... 
-90000880000900... 
.590280008009671... 
.89008024008050... 
.590008742080226.,., 


OOMONDABEWNEH] S 


SB eee 
wnNnrHo 
MAAOMDMDNTDTTAAAWMDWTan@®Wo 


eS 
iy 


Free PDF version [S)= 


272 Cardinality of Sets 


There is a diagonal shaded band in the table. For each n€N, this band 
covers the n’” decimal place of f(n): 


The 1st decimal place of f(1) is the 1st entry on the diagonal. 

The 2nd decimal place of f(2) is the 2nd entry on the diagonal. 
The 8rd decimal place of f(3) is the 3rd entry on the diagonal. 
The 4th decimal place of f(4) is the 4th entry on the diagonal, etc. 


The diagonal helps us construct a number 6b € R that is unequal to any f(n). 
Just let the nth decimal place of b differ from the nth entry of the diagonal. 
Then the nth decimal place of 6 differs from the nth decimal place of f(n). 
In order to be definite, define 6 to be the positive number less than 1 whose 
nth decimal place is 0 if the nth decimal place of f(n) does not equal 0, and 
whose nth decimal place is 1 if the nth decimal place of f(n) equals 0. Thus, 
for the function f illustrated in the above table, we have 


b = 0.01010001001000... 


and b has been defined so that, for any n € N, its nth decimal place is unequal 
to the nth decimal place of f(n). Therefore f(n) 4 6 for every natural number 
n, meaning f is not surjective. 

Since this argument applies to any function f :N— R (not just the one in 
the above example) we conclude that there exist no bijections f :N— R, so 
IN| 4|R| by Definition 14.1. We summarize this as a theorem. 


Theorem 14.2. There exists no bijection f :N—R. Therefore |N|  |RI. 


This is our first indication of how there are different kinds of infinities. 
Both N and R are infinite sets, yet |N| 4|R|. We will continue to develop this 
theme throughout this chapter. The next example shows that the intervals 
(0,oo) and (0,1) on R have the same cardinality. 


Figure 14.1. A bijection f :(0,0oo) — (0,1). Imagine a light source at point P. 
Then f(x) is the point on the y-axis whose shadow is x. 


Richard Hammack Book of Proof 


Sets with Equal Cardinalities 273 


Example 14.3 Show that |(0,00)| = |(0,1)}. 


To accomplish this, we need to show that there is a bijection f : (0,00) — (0,1). 
We describe this function geometrically. Consider the interval (0,00) as the 
positive x-axis of R?. Let the interval (0,1) be on the y-axis as illustrated in 
Figure 14.1, so that (0,00) and (0,1) are perpendicular to each other. 

The figure also shows a point P =(-1,1). Define f(x) to be the point on 
(0,1) where the line from P to x € (0,00) intersects the y-axis. By similar 
triangles, we have 


1 _ fe) 
xt1 x’ 
and therefore : 


If it is not clear from the figure that f : (0,00) — (0,1) is bijective, then you 
can verify it using the techniques from Section 12.2. (Exercise 16, below.) 


It is important to note that equality of cardinalities is an equivalence 
relation on sets: it is reflexive, symmetric and transitive. Let us confirm 
this. Given a set A, the identity function A — A is a bijection, so |A|=|A|. 
(This is the reflexive property.) For the symmetric property, if |A| = |B], then 
there is a bijection f : A — B, and its inverse is a bijection f~!: B — A, so 
|B| = |A|. For transitivity, suppose |A| = |B| and |B| =|C|. Then there are 
bijections f: A— B and g:B—C. The composition gof :A—C is a bijection 
(Theorem 12.2), so |A| =|C|. 

The transitive property can be useful. If, in trying to show two sets A 
and C have the same cardinality, we can produce a third set B for which 
|A| = |B| and |B| =|C|, then transitivity assures us that indeed |A| =|C|. The 
next example uses this idea. 


Example 14.4 Show that |R| =|(0,1)|. 


Because of the bijection g :R — (0,00) where g(x) = 2”, we have |R| = |(0,00)|. 
Also, Example 14.3 shows that |(0,00)| = |(0,1)|. Therefore |R| = |(0,1)}. 


So far in this chapter we have declared that two sets have “the same 
cardinality” if there is a bijection between them. They have “different 
cardinalities” if there exists no bijection between them. Using this idea, 
we showed that |Z| = |N| 4 |R| = |(0,co)| = |(0,1)|. So, we have a means of 
determining when two sets have the same or different cardinalities. But 
we have neatly avoided saying exactly what cardinality is. For example, we 
can say that |Z| =|N|, but what exactly is |Z|, or IN|? What exactly are these 
things that are equal? Certainly not numbers, for they are too big. And 


Free PDF version C\E 


274 Cardinality of Sets 


saying they are “infinity” is not accurate, because we now know that there 
are different types of infinity. So just what kind of mathematical entity is 
|Z|? In general, given a set X, exactly what is its cardinality |X|? 

This is a lot like asking what a number is. A number, say 5, is an 
abstraction, not a physical thing. Early in life we instinctively grouped 
together certain sets of things (five apples, five oranges, etc.) and conceived 
of 5 as the thing common to all such sets. In a very real sense, the number 
5 is an abstraction of the fact that any two of these sets can be matched 
up via a bijection. That is, it can be identified with a certain equivalence 
class of sets under the “has the same cardinality as” relation. (Recall that 
this is an equivalence relation.) This is easy to grasp because our sense of 
numeric quantity is so innate. But in exactly the same way we can say that 
the cardinality of a set X is what is common to all sets that can be matched 
to X via a bijection. This may be harder to grasp, but it is really no different 
from the idea of the magnitude of a (finite) number. 

In fact, we could be concrete and define |X| to be the equivalence class of 
all sets whose cardinality is the same as that of X. This has the advantage 
of giving an explicit meaning to |X|. But there is no harm in taking the 
intuitive approach and just interpreting the cardinality |X| of a set X to 
be a measure of the “size” of X. The point of this section is that we have a 
means of deciding whether two sets have the same size or different sizes. 


Exercises for Section 14.1 


A. Show that the two given sets have equal cardinality by describing a bijection 
from one to the other. Describe your bijection with a formula (not as a table). 


1. R and (0,00) 6. Nand $= {¥2 :nen} 
2. Rand (V2,00) 7. Zand S={...,3,4,9,1,2,4,8, 16,...} 
3. Rand (0,1) 8. Zand S= {xeR:sinx = 1} 
4. The set of even integers and 9. {0,1} xNandN 
the set of odd integers 10. {0,1} «Nand Z 


5. A={3k:keZband B={7k:keZ} 11. [0,1] and (0,1) 
12. N and Z (Suggestion: use Exercise 18 of Section 12.2.) 
13. A(N) and A(Z) (Suggestion: use Exercise 12, above.) 
14. NxNand {(n,m)eENxN:n<m} 


B. Answer the following questions concerning bijections from this section. 
15. Find a formula for the bijection f in Example 14.2 (page 270). 
16. Verify that the function f in Example 14.3 (page 273) is a bijection. 


Richard Hammack Book of Proof 


Countable and Uncountable Sets 275 


14.2 Countable and Uncountable Sets 


Let’s summarize the main points from the previous section. 
1. |A|=|B| if and only if there exists a bijection A — B. 
2. |N|=|Z| because there exists a bijection N — Z. 
3. |N|#/|R| because there exists no bijection N— R. 


Thus, even though N, Z and R are all infinite sets, their cardinalities 
are not all the same. The sets N and Z have the same cardinality, but R’s 
cardinality is different from that of both the other sets. This means infinite 
sets can have different sizes. We now make some definitions to put words 
and symbols to this phenomenon. 

In a certain sense you can count the elements of N; you can count its 
elements off as 1,2,3,4,..., but you’d have to continue this process forever 
to count the whole set. Thus we will call N a countably infinite set, and the 
same term is used for any set whose cardinality equals that of N. 


Definition 14.2. Suppose A is a set. Then A is countably infinite if 
IN| = |A|, that is, if there exists a bijection N— A. The set A is countable 
ifit is finite or countably infinite. The set A is uncountable if it is infinite 
and |N| 4|Al, that is, if A is infinite and there is no bijection N— A. 


Thus Z is countably infinite but R is uncountable. This section deals 
mainly with countably infinite sets. Uncountable sets are treated later. 

If A is countably infinite, then |N| =|Al, so there is a bijection f :N— A. 
Think of f as “counting” the elements of A. The first element of A is f(1), 
followed by f(2), then f(3) and so on. It makes sense to think of a countably 
infinite set as the smallest type of infinite set, because if the counting 
process stopped, the set would be finite, not infinite; a countably infinite set 
has the fewest elements that a set can have and still be infinite. We reserve 
the special symbol Xo to stand for the cardinality of countably infinite sets. 


Definition 14.3 The cardinality of the natural numbers is denoted 
as Xo. That is, |N| =No. Thus any countably infinite set has cardinality No. 


(The symbol & is the first letter in the Hebrew alphabet, and is pronounced 
“aleph.” The symbol No is pronounced “aleph naught.”) The summary of 
facts at the beginning of this section shows |Z| = Xo and |R| 4 Xo. 


Example 14.5 Let E = {2k:k« Z} be the set of even integers. The function 
f:Z-—E defined as f(n) = 2n is easily seen to be a bijection, so we have 
|Z| =|E|. Thus, as |N| =|Z| =|E|, the set E is countably infinite and |E| = No. 


Free PDF version [S)= 


276 Cardinality of Sets 


Here is a significant fact: The elements of any countably infinite set A 
can be written in an infinitely long list a1, a2, a3, a4,... that begins with some 
element a; ¢ A and includes every element of A. For example, the set E in 
the above example can be written in list form as 0, 2, —2, 4, —4, 6, —6, 8, —8,... 
The reason that this can be done is as follows. Since A is countably infinite, 
Definition 14.2 says there is a bijection f: N— A. This allows us to list out 
the set A as an infinite list f(1), f(2), f(8), f(4),... Conversely, if the elements 
of A can be written in list form as aj1,a9,a3,..., then the function f:N—A 
defined as f(n)=a,y is a bijection, so A is countably infinite. We summarize 
this as follows. 


Theorem 14.3 A set A is countably infinite if and only if its elements can 
be arranged in an infinite list a1,a9,a3,a4,... 


As an example of how this theorem might be used, let P denote the set 
of all prime numbers. Since we can list its elements as 2,3,5,7,11,18,..., it 
follows that the set P is countably infinite. 

As another consequence of Theorem 14.3, note that we can interpret the 
fact that the set R is not countably infinite as meaning that it is impossible 
to write out all the elements of R in an infinite list. (After all, we tried to do 
that in the table on page 271, and failed!) 

This raises a question. Is it also impossible to write out all the elements 
of Q in an infinite list? In other words, is the set Q of rational numbers 
countably infinite or uncountable? If you start plotting the rational numbers 
on the number line, they seem to mostly fill up R. Sure, some numbers 
such as V2, m and e will not be plotted, but the dots representing rational 
numbers seem to predominate. We might thus expect Q to be uncountable. 
However, it is a surprising fact that Q is countable. The proof presented 
below arranges all the rational numbers in an infinitely long list. 


Theorem 14.4 The set Q of rational numbers is countably infinite. 


Proof. To prove this, we just need to show how to write the set Q in list 
form. Begin by arranging all rational numbers in an infinite array. This is 
done by making the following chart. The top row has a list of all integers, 
beginning with 0, then alternating signs as they increase. Each column 
headed by an integer k contains all the fractions (in reduced form) with 
numerator k. For example, the column headed by 2 contains the fractions 
2, 2 Z are and so on. It does not contain 2 Z, 2, etc., because those are 
not reduced, and in fact their reduced forms appear in the column headed 
by 1. You should examine this table and convince yourself that it contains 


all rational numbers in Q. 


Richard Hammack Book of Proof 


Countable and Uncountable Sets 277 


oO. 2 wed 2 =2 0 3 =3 4 4 5 <=5 

1 Tf LY LT TT LT YF Tf PT Tt 4 
Qo-t 2 = 38 BUR A BO 
3 2D 38 Bb 2 F 8 BF F F 
1 21 2 2 38 23 4 4 6 = 
3 8 5 56 4 4 5 BS 38 8 
1 <1 2 3 -3 4 -4 5 =5 
4 4 7 FT 5 B68 7 GF 4 F 
1 =i 2 =2 0 3 =3 4 =4 5 =5 
5 56 9 8 7 FT YF BD 6 6 
1 -1 2 -2 8 -38 4 -4 5 =5 
6 6 ti DW 8 8B Ui it 7 F 
Qo-l 2 =2 3 =38 A =4 5 =5 
7 7 bB B Ww wo iB B B 


Next, draw an infinite path in this array, beginning at ¢ and snaking 
back and forth as indicated below. Every rational number is on this path. 


0 1 -1 2 -2 8 -8 4 -4 5 -5 


Free PDF version  [¢2)/EXaTaiEm| 


278 Cardinality of Sets 


Beginning at o and following the path, we get an infinite list of all 
rational numbers: 
1 22 1411 1 


2 
-1, 2, Oo eo Bae ae ae 
385 33 4 7 


2 2 2 


0,1 ! 
> + ves 5” 3’ 


2’ 2’ 


’ goer 


| co 
ou] cw 


3 

2 = 
? 3, 2’ 
By Theorem 14.3, it follows that Q is countably infinite, that is, |Q|=|N|. 


It is also true that the Cartesian product of two countably infinite sets 
is itself countably infinite, as our next theorem states. 
Theorem 14.5 If A and B are both countably infinite, then so is A x B. 


Proof. Suppose A and B are both countably infinite. By Theorem 14.3, we 
know we can write A and B in list form as 


A {a1,09,43,a4,...}, 
B = {bj,bo,b3,b4,...}. 


Figure 14.2 shows how to form an infinite path winding through all of A x B. 
Therefore A x B can be written in list form, so it is countably infinite. 


& 2 
? Gab (ap,b7) es oe co te me 3 
(a1,06) (a2, bg) (a3, bg) (a4,b6) (a5,b6) (ag, bg) (a7, b6) -* 
(a1,05) (a2,65) (a3, 65) (a4,b5) (5,65) (ag, 65) (a7,b5) -*° 
(a1,64) (ag, 64) (a3,64) (a4,b4) (a5, 64) (a6, 4) (7,4) -° 
(a1,63) (a2,b3) (a3, 63) (a4,b3) (a5, 53) (ag,b3) (a7,63) -*° 
(a1, 62) (ag,b2) (3,62) (a4,b2) (45,62) (ag, 69) (a7,b2) --- 


(a1, 61) (ag,b1) (a3, 64) (a4,61) (a5, 61) (ag, 64) (a7,64) - 


Figure 14.2. A product of two countably infinite sets is countably infinite 


Richard Hammack Book of Proof 


Countable and Uncountable Sets 279 


As an example of a consequence of this theorem, notice that since Q is 
countably infinite, the set Q x Q is also countably infinite. 

Recall that the word “corollary” means a result that follows easily from 
some other result. We have the following corollary of Theorem 14.5. 


Corollary 14.1 Given n countably infinite sets A1,A9,...,An, with n= 2, 
the Cartesian product A, x Ag x---x A, is also countably infinite. 


Proof. The proof is by induction on n. For the basis step, notice that when 
n= 2 the statement asserts that for countably infinite sets A; and Ag, the 
product A, x Ag is countably infinite, and this is true by Theorem 14.5. 

Assume that for some k = 2, any product A, x Ag x---x Ay of countably 
infinite sets is countably infinite. Consider a product A; x Ag x---x Ag x Agit 
of k+1 countably infinite sets. It is easy to confirm that the function 


f:A1xAgxAgx-:-xApxApgi1 —> (A1x Ag Agx--x Ag) x Ags 


Oasis) = “(rt Fa,2105e) ee) 


is bijective, so |Ay xAgxAgx---x Ap x Apsi| = \(Ay x AgxAgx--: x Ap) Ags]. 
By the induction hypothesis, (Aj x Ag x Ag x ---x Ay) x Apy1 is a product of 
two countably infinite sets, so it is countably infinite by Theorem 14.5. As 
noted above, A; x Ag x A3 x---x Az x Aps1 has the same cardinality as the set 
(A, x Ag x A3x-::x Az) x Agiy, SO it too is countably infinite. | 


Theorem 14.6 If A and B are both countably infinite, then their union 
AUB is countably infinite. 


Proof. Suppose A and B are both countably infinite. By Theorem 14.3, we 
know we can write A and B in list form as 


A = {a1,a2,a3,a4,...}, 
B = {bj,b2,b3,b4,...}. 


We can “shuffle” A and B into one infinite list for AUB as follows. 
AUB = {a1,b1,09,b9,03, 63,04, ba4,...}. 


(We agree not to list an element twice if it belongs to both A and B.) Thus 
AUB is countably infinite by Theorem 14.3. a 


Free PDF version [S)= 


280 Cardinality of Sets 


Exercises for Section 14.2 


. Prove that the set A = {In(n):n €N} CR is countably infinite. 
. Prove that the set A = {(m,n)€ NxN: m <n} is countably infinite. 
. Prove that the set A = {(5n,-3n):n€ Z} is countably infinite. 


. Prove that the set of all irrational numbers is uncountable. (Suggestion: Consider 
proof by contradiction using Theorems 14.4 and 14.6.) 


rR OO DN = 


ou 


. Prove or disprove: There exists a countably infinite subset of the set of irrational 
numbers. 


6. Prove or disprove: There exists a bijective function f:Q—R. 
7. Prove or disprove: The set Q!° is countably infinite. 

8. Prove or disprove: The set Z x Q is countably infinite. 

9. Prove or disprove: The set {0,1} x N is countably infinite. 


10. Prove or disprove: The set A = {v2 :n€N} countably infinite. 

11. Describe a partition of N that divides N into eight countably infinite subsets. 
12. Describe a partition of N that divides N into Xo countably infinite subsets. 
13. Prove or disprove: If A={X ¢N : X is finite}, then |A| = No. 

14, Suppose A = {(m,n)ENxR:n=2m}. Is it true that |N| =|Al? 


15. Theorem 14.5 implies that N x N is countably infinite. Construct an alternate 
proof of this fact by showing that the function y:Nx N—N defined as g(m,n) = 
2”-1(2m — 1) is bijective. 


14.3 Comparing Cardinalities 


At this point we know that there are at least two different kinds of infinity. 
On one hand, there are countably infinite sets such as N, of cardinality Xo. 
Then there is the uncountable set R. Are there other kinds of infinity 
beyond these two kinds? The answer is “yes,” but to see why we first need 
to introduce some new definitions and theorems. 

Our first task will be to formulate a definition of what we mean by 
|A|<|B|. Of course if A and B are finite we know exactly what this means: 
|A| <|B| means that when the elements of A and B are counted, A is found 
to have fewer elements than B. But this process breaks down if A or B is 
infinite, for then the elements can’t be counted. 

The language of functions helps us overcome this difficulty. Notice that 
for finite sets A and B it is intuitively clear that |A| < |B| if and only if 
there exists an injective function f : A — B but there is no bijective function 
f:A—B. The following diagram illustrates this: 


Richard Hammack Book of Proof 


Comparing Cardinalities 281 


We will use this idea to define what is meant by |A| <|B| and |A| <|B|. For 
emphasis, the following definition also restates what is meant by |A| = |B|. 


Definition 14.4 Suppose A and B are sets. 
1. |A|=|B| means there is a bijection A — B. 


2. |A|<|B| means there is an injection A — B, but no bijection A — B. 


3. |A|<|B| means there is an injection A — B. 


For example, consider N and R. The function f :N — R defined as f(n) =n 
is clearly injective, but it is not surjective because given the element 5 ER, 
we have f(n) 4 5 for every ne N. In fact, Theorem 14.2 of Section 14.1 
asserts that there is no surjection N — R, and hence no bijections either. 
Definition 14.4 yields 

IN| < |RI. (14.1) 


Said differently, Xo < |RI. 
Is there a set X for which |R| < |X|? The answer is “yes.” The next 
theorem implies |R| < | A(R)|. (Recall that A(A) denotes the power set of A.) 


Theorem 14.7 If A is any set, then |A| <|A(A)|. 


Proof. Before beginning the proof, we remark that this statement is obvious 
if A is finite, for then |A| < 2'4! =|A(A)|. But our proof must apply to all 
sets A, both finite and infinite, so it must use Definition 14.4. 

We prove the theorem with direct proof. Let A be an arbitrary set. 
According to Definition 14.4, to prove |A| <|A(A)| we must show that there 
is an injection f:A— A(A), but no bijection f: A — A(A). 

To see that there is an injection f : A — A(A), define f by the rule 
f(x) = {x}. In words, f sends any element x of A to the one-element set 
{x} e A(A). Then f:A — AA) is injective, as follows. Suppose f(x) = f(y). 
Then {x} = {y}. Now, the only way that {x} and {y} can be equal is if x= y, 
so it follows that x = y. Thus f is injective. 

Next we need to show that there exists no bijection A — A(A). We 
will verify this by proving that there is no surjection A — A(A). Take an 


Free PDF version [S)= 


282 Cardinality of Sets 


arbitrary function f :A — A(A). To show f is not surjective we will produce 
a set Be A(A) for which f(a)#B for all ae A. Notice that for any element 
x€ A, we have f(x)e A(A), that is, f(x)c A. Thus f is a function sending 
elements of A to subsets of A. It follows that for any x¢ A, either x € f(x) or 
x ¢ f(x). Using this idea, define the following element Be A(A): 


B= {xe A:x¢ f(x)CA. 


Take an arbitrary ace A. The following two cases show that f(a) 4B. 


Case 1. If a¢ f(a), then the definition of B implies a € B. Consequently, 
f(a)=B is impossible, for it would mean a¢ B andaecB. 


Case 2. If aé f(a), then the definition of B implies a ¢ B. Consequently, 
f(a) =B is impossible, for it would mean ae B anda¢B. 


So f(a) 4B for all ae A, and hence f is not surjective. As this holds for 
any function f:A — A(A), there are no surjective functions f:A— A(A). 
Consequently there are no such bijections either. 

In conclusion, we have seen that there exists an injection A — (A) but 
no bijection A — (A), so Definition 14.4 implies that |A| < | A(A)|. a 


Beginning with the set A =N and applying Theorem 14.7 over and over 
again, we get the following chain of infinite cardinalities. 


No = IN| < LAIN) < IA(AWN))| < |ACACAN)) < + (14.2) 


Thus there is an infinite sequence of different types of infinity, starting with 
No and becoming ever larger. The set N is countable, and all the sets A(N), 
YP(FYIN)), etc., are uncountable. 

In the next section we will prove that | A(N)| =|R|. Thus |N| and |R| are 
the first two entries in the chain (14.2) above. They are just two relatively 
tame infinities in a long list of other wild and exotic infinities. 

Unless you plan on studying advanced set theory or the foundations of 
mathematics, you are unlikely to ever encounter any types of infinity beyond 
No and |R|. Still you will in future mathematics courses need to distinguish 
between countably infinite and uncountable sets, so we close with two final 
theorems that can help you do this. 


Theorem 14.8 An infinite subset of a countably infinite set is countably 
infinite. 


Proof. Suppose A is an infinite subset of the countably infinite set B. As B 
is countably infinite, its elements can be written in a list b1,b2,b3,b4,... 


Richard Hammack Book of Proof 


Comparing Cardinalities 283 


Then we can also write A’s elements in list form by proceeding through 
the elements of B, in order, and selecting those that belong to A. Thus A 
can be written in list form, and since A is infinite, its list will be infinite. 
Consequently A is countably infinite. a 


Theorem 14.9 IfUcCA, and U is uncountable, then A is uncountable. 


Proof. For the sake of contradiction say that U CA, and U is uncountable 
but A is not uncountable. Then since UC A and U is infinite, then A must 
be infinite too. Since A is infinite, and not uncountable, it must be countably 
infinite. Then U is an infinite subset of a countably infinite set A, so U 
is countably infinite by Theorem 14.8. Thus U is both uncountable and 
countably infinite, a contradiction. | 


Theorems 14.8 and 14.9 can be useful when we need to decide whether a 
set is countably infinite or uncountable. They sometimes allow us to decide 
its cardinality by comparing it to a set whose cardinality is known. 

For example, suppose we want to decide whether or not the set A = R? is 
uncountable. Since the x-axis U = {(x,0):x € R} CR? has the same cardinality 
as R, it is uncountable. Theorem 14.9 implies that R? is uncountable. Other 
examples can be found in the exercises. 


Exercises for Section 14.3 


1. Suppose B is an uncountable set and A is a set. Given that there is a surjective 
function f :A — B, what can be said about the cardinality of A? 


2. Prove that the set C of complex numbers is uncountable. 


ivy) 


. Prove or disprove: If A is uncountable, then |A| = |R|. 
4, Prove or disprove: If AS B¢C and A and C are countably infinite, then B is 
countably infinite. 
5. Prove or disprove: The set {0,1} x R is uncountable. 
6. Prove or disprove: Every infinite set is a subset of a countably infinite set. 
7. Prove or disprove: If A¢B and A is countably infinite and B is uncountable, 
then B-A is uncountable. 
8. Prove or disprove: The set {(a,a2,a3,...):a; € Z} of infinite sequences of integers 
is countably infinite. 
9. Prove that if A and B are finite sets with |A| = |B], then any injection f: A — B is 
also a surjection. Show this is not necessarily true if A and B are not finite. 
10. Prove that if A and B are finite sets with |A| =|B|, then any surjection f:A—B 
is also an injection. Show this is not necessarily true if A and B are not finite. 


Free PDF version C\E 


284 Cardinality of Sets 


14.4 The Cantor-Bernstein-Schroder Theorem 


An often used property of numbers is that if a <b and b <a, thena=b. It 
is reasonable to ask if the same property applies to cardinality. If |A| <|B| 
and |B| <|Al, is it true that |A| =|B|? This is in fact true, and this section’s 
goal is to prove it. This will yield an alternate (and highly effective) method 
of proving that two sets have the same cardinality. 

Recall (Definition 14.4) that |A| <|B| means that there is an injection 
f:A—B. Likewise, |B| <|A| implies that there is an injection g:B— A. 

Our aim is to show that if |A| < |B| and |B| < |Al, then |A| =|B|. In 
other words, we aim to show that if there are injections f : A — B and 
g:B—A, then there is a bijection h: A — B. The proof of this fact, though 
not particularly difficult, is not entirely trivial, either. The fact that f and 
g guarantee that such an A exists is called the the Cantor-Bernstein- 
Schréder theorem. This theorem is very useful for proving two sets A 
and B have the same cardinality: it says that instead of finding a bijection 
A — B, it suffices to find injections A — B and B — A. This is useful because 
injections are often easier to find than bijections. 

We will prove the Cantor-Bernstein-Schréder theorem, but before doing 
so let’s work through an informal visual argument that will guide us through 
(and illustrate) the proof. 

Suppose there are injections f: A — B and g:B — A. We want to use 
them to produce a bijection h: A — B. Sets A and B are sketched below. 
For clarity, each has the shape of the letter that denotes it, and to help 
distinguish them the set A is shaded. 


Figure 14.3. The sets A and B 


The injections f: A — B and g:B—A are illustrated in Figure 14.4. 
Think of f as putting a “copy” f(A) = {f(x):x € A} of A into B, as illustrated. 
This copy, the range of f, does not fill up all of B (unless f happens to be 
surjective). Likewise, g puts a “copy” g(B) of B into A. Because they are not 
necessarily bijective, neither f nor g is guaranteed to have an inverse. But 
the map g:B— g(B) from B to g(B) = {g(x): x € B} is bijective, so there is an 
inverse g-!: g(B) — B. (We will need this inverse soon.) 


Richard Hammack Book of Proof 


The Cantor-Bernstein-Schréder Theorem 285 


Figure 14.4. The injections f:A—Bandg:B—-A 


Consider the chain of injections illustrated in Figure 14.5. On the left, 
g puts a copy of B into A. Then f puts a copy of A (containing the copy of B) 
into B. Next, g puts a copy of this B-containing-A-containing-B into A, and 
so on, always alternating g and f. 


Figure 14.5. An infinite chain of injections 


Let’s analyze our infinite sequence B— A—B—A—-B-A-—.--- 

The first time A occurs in this sequence, it has a shaded region A — g(B). 
In the second occurrence of A, the shaded region is (A—g(B))U(gof)(A—g(B)). 
In the third occurrence of A, the shaded region is 


(A-g(B)) U (go f\A-g(B)) U (gofogofiA-g(B)). 


To tame the notation, let’s say (go f)? = (gof)o(gof), and (gof)? = 
(go f)o(gof)o(gof), and so on. Let’s also agree that (go f)° = 14, that is, it 
is the identity function on A. Then the shaded region of the nth occurrence 
of A in the sequence is 


n-1 
LU (gof)*(A-g(B)). 
k=0 


This process divides A into gray and white regions: the gray region is 


G=UJ(gof)*(A-g(B)), 
k=0 


and the white region is A—G. (See Figure 14.6.) 


Free PDF version [S)= 


286 Cardinality of Sets 


Figure 14.6 suggests our desired bijection h: A — B. The injection f 
sends the gray areas on the left bijectively to the gray areas on the right. 
The injection g~! : g(B) — B sends the white areas on the left bijectively to 
the white areas on the right. We can thus define h: A — B so that h(x) = f(x) 
if x is a gray point, and h(x) = g~\(x) if x is a white point. 


Figure 14.6. The bijection h: A —B 


This informal argument suggests that given injections f: A — B and 
g:B—A, there is a bijection h: A — B. But it is not a proof. We now present 
this as a theorem and tighten up our reasoning in a careful proof, with the 
above diagrams and ideas as a guide. 


Theorem 14.10 (The Cantor-Bernstein-Schréder Theorem) 
If |A| < |B| and |B| <|Al, then |A|= |B). In other words, if there are injections 
f:4—-B and g:B—A, then there is a bijection h: A — B. 


Proof. (Direct) Suppose there are injections f:A — B and g:B—A. Then, 
in particular, g:B— g(B) is a bijection from B onto the range of g, so it 


has an inverse g~!: g(B) — B. (Note that g:B— A itself has no inverse 
g-!:A—B unless g is surjective.) Consider the subset 


G= Ulgef)*(A-g(B)) cA. 
k=0 


Richard Hammack Book of Proof 


The Cantor-Bernstein-Schréder Theorem 287 


Let W=A-G, so A=GUW is partitioned into two sets G (think gray) and 
W (think white). Define a function h: A — B as 


_{ f@  ifxeG 
Ho=| Oy ifxe W. 


Notice that this makes sense: if xe W, then x¢ G, sox ¢ A—g(B)SG, hence 
x € g(B), so g-1(x) is defined. 

To finish the proof, we must show that h is both injective and surjective. 

For injective, we assume A(x) = h(y), and deduce x= y. There are three 
cases to consider. First, if x and y are both in G, then h(x) = h(y) means 
f(x) = f(y), so x= y because f is injective. Second, if x and y are both in W, 
then h(x) = h(y) means g~!(x) = g“(y), and applying g to both sides gives 
x= y. In the third case, one of x and y is in G and the other is in W. Say 
x€G and yeW. The definition of G gives x = (go f)*(z) for some k = 0 and 
z€A-—g(B). Note h(x) = h(y) now implies f(x) = g~'(y), that is, f((go f)*(z)) = 
g '(y). Applying g to both sides gives (go f)*t1(z) = y, which means ye G. 
But this is impossible, as ye W. Thus this third case cannot happen. But in 
the first two cases h(x) = h(y) implies x = y, so h is injective. 

To see that A is surjective, take any b< B. We will find an xe A with 
h(x) = 6. Note that g(b)€ A, so either g(b) € W or g(b)€G. In the first case, 
h(g(b)) = g-_1(g(b)) = b, so we have an x = g(b)€ A for which A(x) = b. In the 
second case, g(b)<« G. The definition of G shows 


g(b) =(gof)*(z) 


for some z€ A-g(B) and k=0. In fact we have k > 0, because k = 0 would 
give g(b) =(gof)*(z)=z¢A-—g(B), but clearly g(b) ¢ A—g(B). Thus 


gb) = (gof)o(gof)* (2) 
a(f((gef"@))}. 


Because g is injective, this implies 
b=f((gof)’ \2)). 


Let x =(go f)*-(z), so x € G by definition of G. Observe that A(x) = f(x) = 
f ((go f)*-(z)) = b. We have now seen that for any b € B, there is an xe A for 
which h(x) = b. Thus A is surjective. 

Since h: A — B is both injective and surjective, it is also bijective. a 


Free PDF version [S)= 


288 Cardinality of Sets 


Here are some examples illustrating how the Cantor-Bernstein-Schréder 
theorem can be used. This includes a proof that |R| = | A(N)I. 


Example 14.6 The intervals [0,1) and (0,1) in R have equal cardinalities. 


Surely this fact is plausible, for the two intervals are identical except for 
the endpoint 0. Yet concocting a bijection [0, 1) — (0,1) is tricky. (Though not 
particularly difficult: see the solution of Exercise 11 of Section 14.1.) 

For a simpler approach, note that f(x) = 4 + 5x is an injection [0,1) — (0,1). 
Also, g(x) =x is an injection (0,1) — [0,1). The Cantor-Bernstein-Schréder 


theorem guarantees a bijection h :[0,1) — (0,1), so |[0, 1)| = |(0, 1)|. 


Theorem 14.11 The sets R and “(N) have the same cardinality. 


Proof. Example 14.4 shows that |R| = |(0,1)|, and Example 14.6 shows |(0, 1)| = 
\[0,1)|. Thus |R| = |[0,1)|, so to prove the theorem we just need to show that 
\[0,1)| =| A(N)|. By the Cantor-Bernstein-Schroder theorem, it suffices to 
find injections f :[0,1) — A(N) and g: A(N) — (0,1). 

To define f :[0,1) — A(N), we use the fact that any number in [0,1) has 
a unique decimal representation 0.b1b2b3b4..., where each b; one of the 
digits 0,1,2,...,9, and there is not a repeating sequence of 9’s at the end. 
(Recall that, e.g., 0.359999 = 0.360, etc.) Define f :[0,1) — A(N) as 


f (0.b1b2b3b4...) = {10b1, 10%b2, 10°bs, ...} — {0}. 


For example, f (0.121212) = {10, 200, 1000, 20000, 100000, ...}, and f(0.05) = 
{500}. Also f(0.5) = (0.50) = {50}. To see that f is injective, take two 
unequal numbers 0.b1b2b3b4... and 0.d,;dgd3d4... in [0,1). Then 65; #4 d; 
for some index i. Now, either b; 4 0 or d; 4 0; without loss of generality 
say b; #0. Then b;10' € f(0.b1b2b3b4...) but b;10' ¢ f(0.didad3dq4...), SO 
f(0.b162b3b4...)# f(0.didod3d4...). Consequently f is injective. 

Next we define g: A(N) — [0,1). For X « A(N) (that is, for X <N), let 
g(X) = 0.b1b9b3b4... be the base-10 decimal number for which 6; = 1 ifie X 
and b; =0 ifi¢ X. For example, g({1,3}) = 0.101000, and g({2,4,6,8,...}) = 
0.01010101. Also g() = 0 and g(N) = 0.1111. Note that g is injective: If X #Y, 
then there is at least one integer i that belongs to one of X or Y, but not 
the other. Consequently g(X) 4 g(Y) because they differ in the ith decimal 
place. This shows g is injective. 

From the injections f :[0,1) — A(N) and g: A(N) — [0,1), the Cantor- 
Bernstein-Schroéder theorem guarantees a bijection h :[0,1) — A(N). Hence 
\[0,1)| = | A(N)|. As |R| = |[0, 1)|, we conclude |R| = | A(N)|. | 


Richard Hammack Book of Proof 


The Cantor-Bernstein-Schréder Theorem 289 


We know that |R| 4 IN|. But we just proved |R| = |A(N)|. This suggests 
that the cardinality of R is not “too far” from |N| =o. We close with a few 
informal remarks on this mysterious relationship between No and |RI. 

We established earlier in this chapter that Xo < |R|. For nearly a cen- 
tury after Cantor formulated his theories on infinite sets, mathematicians 
struggled with the question of whether or not there exists a set A for which 


No < |A| < IRI. 


It was commonly suspected that no such set exists, but no one was able to 
prove or disprove this. The assertion that no such A exists came to be called 
the continuum hypothesis. 

Theorem 14.11 states that |R| =|A(N)|. Placing this in the context of the 
chain (14.2) on page 282, we have the following relationships. 


No IR| 
I I 
IN| < [AN < |ACAN) < |A(ACAWN)))| < 


From this, we can see that the continuum hypothesis asserts that no set 
has a cardinality between that of N and its power set. 

Though this may seem intuitively plausible, it eluded proof since Cantor 
first posed it in the 1880s. In fact, the real state of affairs seems almost 
paradoxical. In 1931, the logician Kurt Godel proved that for any sufficiently 
strong and consistent axiomatic system, there exist statements which can 
neither be proved nor disproved within the system. 

Later he proved that the negation of the continuum hypothesis cannot be 
proved within the standard axioms of set theory (i.e., the Zermelo-Fraenkel 
axioms, mentioned in Section 1.10). This meant that either the continuum 
hypothesis is false and cannot be proven false, or it is true. 

In 1964, Paul Cohen discovered another startling truth: Given the laws 
of logic and the axioms of set theory, no proof can deduce the continuum 
hypothesis. In essence he proved that the continuum hypothesis cannot be 
proved. 

Taken together, Godel and Cohens’ results mean that the standard 
axioms of mathematics cannot “decide” whether the continuum hypothesis 
is true or false, and that no logical conflict can arise from either asserting 
or denying the continuum hypothesis. We are free to either accept it as 
true or accept it as false, and the two choices lead to different—but equally 
consistent—versions of set theory. 


Free PDF version C\E 


290 Cardinality of Sets 


On the face of it, this seems to undermine the foundation of logic, and 
everything we have done in this book. The continuum hypothesis should be 
a statement—it should be either true or false. How could it be both? 

Here is an analogy that may help make sense of this. Consider the 
number systems Z,. What if we asked whether [2] = [0] is true or false? Of 
course the answer depends on n. The expression [2] = [0] is true in Zz and 
false in Z3. Moreover, if we assert that [2] = [0] is true, we are logically forced 
to the conclusion that this is taking place in the system Zo. If we assert 
that [2] =[0] is false, then we are dealing with some other Z,,. The fact that 
[2] = [0] can be either true or false does not necessarily mean that there 
is some inherent inconsistency within the individual number systems Z,,. 
The equation [2] = [0] is a true statement in the “universe” of Zz and a false 
statement in the universe of (say) Z3. 

It is the same with the continuum hypothesis. Saying it’s true leads to 
one system of set theory. Saying it’s false leads to some other system of set 
theory. Gddel and Cohens’ discoveries mean that these two types of set 
theory, although different, are equally consistent and valid mathematical 
universes. 

So what should you believe? Fortunately, it does not make much dif- 
ference, because most important mathematical results do not hinge on 
the continuum hypothesis. (They are true in both universes.) Unless you 
undertake a deep study of the foundations of mathematics, you will be fine 
accepting the continuum hypothesis as true. Most mathematicians are 
agnostics on this issue, but they tend to prefer the version of set theory in 
which the continuum hypothesis holds. 

The situation with the continuum hypothesis is a testament to the 
immense complexity of mathematics. It is a reminder of the importance 
of rigor and careful, systematic methods of reasoning that begin with the 
ideas introduced in this book. 


Exercises for Section 14.4 

. Show that if A¢B and there is an injection g:B— A, then |A| = |BI. 

. Show that |R?|=|R|. Suggestion: Begin by showing |(0, 1) x (0, 1)| = |(0, 1)}. 
. Let F be the set of all functions N — {0,1}. Show that |R| = |-71. 

Let .F be the set of all functions R — {0,1}. Show that |R| <|.F]. 

. Consider the subset B = {(x,y):x7 + y? <1} CR®. Show that |B] =|R?|. 

. Show that | A(N x N)| =| AN). 


. Prove or disprove: If there is an injection f: A— B and a surjection g:A—B, 
then there is a bijection h: A — B. 


Aa ak WON 


Richard Hammack Book of Proof 


Conclusion 


f you have internalized the ideas in this book, then you have a gained set of 
rhetorical tools for deciphering and communicating mathematics. These 
tools are indispensable at the advanced levels. But of course it takes more 
than mere tools to build something. Creativity, inspiration, skill, talent, 
intuition, passion, planning and persistence are also vitally important. It 
is safe to say that if you have come this far, then you probably possess a 
sufficient measure of these traits. 

The quest to understand mathematics has no end, but you are well 
equipped for the journey. It is my hope that the things you have learned 
from this book will lead you to a higher plane of understanding, creativity 
and expression. 

Good luck and best wishes. 


R.H. 


Solutions 


Chapter 1 Exercises 


Section 1.1 


BGR Site St Sa a ee! 


= {B:kez} 
37. {xeN:x2 <0} =0 


1. {5x-1:x¢ Z}={...-11,-6,-1,4,9, 14,19, 24, 29,...} 
3. {xe Z:-2<x<T7}={-2,-1,0,1,2,3,4,5, 6} 

5. {xe Rix? = 3) = {-Vv3, 3} 

7. {x€R:x?+5x = —6} = {-2,-3} 

9. {x€R:sinax =0}={...,-2,-1,0,1,2,3,4,...}=Z 
11, {xe Z: |x| <5} = {-4,-3,-2,-1,0,1,2,3,4} 

13. {xe Z:|6x| <5} = {0} 

15. {5a +2b:a,b€ Z}={...,-2,-1,0,1,2,3,...}=Z 
17. {2,4,8,16,32,64...}={2":xeN} 

19. {...,-6,-3,0,3,6,9, 12, 15,...}={8x:x€Z} 

21. {0,1,4,9,16,25,36,...}= {x?:xeZ} 
23. {3,4,5, 
25. {...,5,4>9>1,2,4,8,...} = {2" ne Z} 

27. {...,-m, £0, 3,7, 32 ,2m, 2... 
29. |{{1}, {2,{3,4}}, }1 =3 
35. xe Z:x2<10}|=7 

39 


45. 


- {(x,y):x€ (1,21, 9 € [1,2]} 


| 
a 
= 
bo 
wv 


41. ((x,y):xe€[-1,1],y=B 


2 
—_ 
< > 
-3 -2 -1 1 2 3 
il 
—2 


33. {xe Z: |x| < 10}|=19 


43. (x,y): |x] = 2,9 € [0, 1]} 


{(x,y):x,yER x2 +y2=1 47. {(x,y):x,yeR, y2x2-1} 49. {(x,x+y):xER, ye Z} 


2 


a 


< 
-3 -2 


[A 2 3 


2 


v 


3 


A 


wy 


293 


51. (x,y) R? : (y—x\y +x) = 0} 


Section 1.2 
1. Suppose A = {1,2,3,4} and B = {a,c}. 
(a) Ax B= {(1,a),(1,c),(2,a),(2,c), (8, a), (3, c),(4,a), (4, c)} 
(b) Bx A= {(a,1),(a, 2), (a,3), (a, 4), (c, 1), (c, 2), (c, 3), (ce, 4)} 
(c) Ax A= {(1,1),(1, 2), (1,3), (1, 4), (2, 1), (2, 2), (2,3), (2, 4), 
(3, 1), (3, 2), (3,3), (3, 4), (4, 1), (4, 2), (4,3), (4, 4)} 
(d) Bx B= {(a,a),(a,c),(c,a),(c,c)} 
(e) 6x B= {(a,b):a€ 0,b € B}=@ (There are no ordered pairs (a,b) with a€ @.) 
(f) (Ax B)x B= 
{((1,a),a@),((1,¢), a), (2, a), a), (2, €), a), (3, a), a), (3, c), a), ((4,a), a), ((4,¢),@), 
((1,a),c),((1, ¢), c), (2, a), c), ((2, c), c), ((3, a), c), (3, ¢), c), ((4, a), c), (4, €), c)} 


(g) Ax(BxB)= 
{(1,(a,a)), (1, (a,c)),(1, (ce, a)), (1, (c,¢)), 
(2,(a,a)), (2, (a, c)), (2,(c,a)), (2, (c, €)), 
(3,(a,a)), (3, (a, c)),(3,(c,a)), (3,(c,¢)), 
(4,(a,a)),(4,(a,c)),(4,(c,a)),(4,(c, c))} 


(h) B? = {(a,a,a),(a,a,c),(a,c,a),(a,c,c),(c,a,a),(c,a,c),(c,c,a),(c,c,c)} 
3. {x eR: x? = 2} x {a,c,e} = {(-V2,a),(V2,a),(- V2, c), (V2, c),(- V2, e), (V2, e)} 
5. {xe R:x2= 2) x {xe R: |x| =2} = {(-V2, -2), (V2, 2), (- V2, 2), (V2, -2)} 
7. {D} x {0, D} x (0, 1} = ((G, 0, 0),(G, 0, 1), (G, S, 0), (G, S, 1)} 


Sketch the following Cartesian products on the x-y plane. 


9. {1,2,3} x {-1,0, 1} 11. [0,1] x [0,1] 13. {1,1.5,2} x [1,2] 
1 . ry e 1 1 
3 -—2 -1 1 2 3 3 -2 -1 1 2 38 3 -2 -1 1 2 3 
=1 o 8 6 -1 -1 
—2 -2 -2 


Free PDF version [S)= 


294 Solutions 


15. (1) x [0,1] 17, NxZ ~ 5 19. [0,1] « [0, 1] x [0, 1] 
2 2 . . . 2, 
1 1 o 8 a 
fret | i BS 929 =|) ie 8 3-2 — 12 3 
mai -1 oe -1 
-2 -2 e e . -2 
Section 1.3 


A. List all the subsets of the following sets. 
1. The subsets of {1,2,3,4} are: {}, {1}, {2}, {8}, {4}, {1,2}, {1,3}, {1,4}, {2,3}, {2,4}, 
{3,4}, {1,2,3}, {1,2,4}, {1,3,4}, {2,3,4}, {1,2,3, 4}. 
3. The subsets of {{R}} are: {} and {{R}}. 
5. The subsets of {@} are {} and {@}. 
7. The subsets of {R, {Q,N}} are {}, {(R},{{Q,N}, {R, {Q, N}}. 


B. Write out the following sets by listing their elements between braces. 
9. {xX :X €{3,2,a} and |X|= 2} = {{8, 2}, {8, a}, {2,a}} 
11. {X:X ¢{3,2,a} and |X|=4}==¢@ 


C. Decide if the following statements are true or false. 
13. R°? CR? is true because any set is a subset of itself. 


15. {(x,y):x-1=0}¢ {(x,y):x2-x=0}. This is true. (The even-numbered ones 
are both false. You have to explain why.) 


Section 1.4 
A. Find the indicated sets. 
1. A({{a, 6}, {ch}) = (9, {{a, b}}, {{ch}, {{a, b}, {c}} 
3. PA ({{D},5}) = {B, {(G}}, {5}, {0}, 5} 
5. PAP(YA({2})) = {G, {D}, {2H}, {D, {2} 
7. P({a,b}) x A({0, 1}) = 
{ (S,9), (, {0}), (%,{1}), (%,{0,1}), 
({a},9), ({a}, {0}), ({a},{1}), Ca}, (0, 1}), 
({b}, 9), ({b}, {0}), ({b},{1}),  ({b}, (0, 1}), 
({a,b},0), ({a,b},{0}), ({a,b},{1}), Ca,b},{0,) } 
9. P({a, b} x {0}) = {4, {(a, 0)}, {(b, 0)}, {(a, 0), (6, O)} 
11. {(X ¢ A({1,2,3}): |X| <1= 
{D,{O}, {1}, {23}, {3}, (1, 23}, (1, 3}, (2, 3}}, {{1, 2, 3}}} 


B. Suppose that |A| =m and |B| =n. Find the following cardinalities: 


Richard Hammack Book of Proof 


295 


13. | AAPA = 22°") 

15. |A(A x B)| = 2" 

17 {X¢ AA): |X| <DBl=m+1 

19. |A(A(A(A x G)))| = |ACA(A@ = 4 
Section 1.5 


1. Suppose A = {4,3,6,7,1,9}, B = {5,6,8,4} and C = {5,8, 4}. Find: 
(a) AUB = {1,3,4,5,6,7,8, 9} (ff) ANC = {4} 
(b) AnB= {4,6} 
(c) A-B= {3,7,1,9} 
(d) A-C = {3,6,7,1,9} (h) BUC = {5,6,8,4} 
(e) B-A= {5,8} Gi) C-B=@ 


(g) BNC= 5,8, 4} 


3. Suppose A = {0,1} and B = {1,2}. Find: 
(a) (Ax B)n@xB)= {(1,1),(1,2)} 
(b) (A x B)U(B x B)= {(0, 1), (0, 2), (1, 1), (1, 2), (2, 1), (2, 2)} 


(c) (A x B)—(B x B) = {(0,1), (0, 2)} (f) A(A)n AB) = {@, {1} 
(d) (AnB)x A= {(1,0),(1, D} (g) P(A)— AB) = {{0}, (0, 1} 
(e) (Ax B)NB=@ (h) AAnB)= {ft}, {1} 


(i) {%,{(0, D}, {(0, 2)}, {(1, D}, (C1, 2)}, {(0, D, (0, 2)}, (0, 1), (1, D}, (0, 1), (1, 2)}, (0, 2), (1, D}, 
{(0, 2), (1, 2)}, (1, 1), (1, 2)}, (0, 2), (1, 1), (1, 2)}, (0, 1), (1, 1), (1, 2)}, (0, 1), (0, 2), (1, 2)}, 
{(0, 1), (0, 2), (1, VD}, {(0, 1), (0, 2), (1, 1), (1, 2)}} 


5. Sketch the sets X =[1,3]x [1,3] and Y = [2,4] x [2,4] on the plane R?. On separate 
drawings, shade in the sets X UY, XnY, X-Y and Y-X. (Hint: X and Y are 
Cartesian products of intervals. You may wish to review how you drew sets like 
[1,3] x [1,3] in the Section 1.2.) 

4 4 


XuUY 


PrP wD wo fF 
Pe wo wo ff 


4 
3 3 3 
2 2 2 
1 1 1 


7. Sketch the sets X = {(x,y)€R?:x7+y? <1} and Y = {(x,y)eR?:x=0} on R?. On 
separate drawings, shade in the sets X UY, XNY, X-Y and Y-X. 


2 2 2 
XxnY xXx-Y Y-X 
1 1 
> < > < > 


Free PDF version COE 


296 Solutions 


9. The first statement is true. (A picture should convince you; draw one if necessary.) 
The second statement is false: Notice for instance that (0.5,0.5) is in the right- 
hand set, but not the left-hand set. 


Section 1.6 
1. Suppose A = {4,3,6,7,1,9} and B = {5,6,8, 4} have universal set U = {ne Z:0<n<10}. 


(a) A= {0,2,5,8, 10} (f) A-B= {4,6} 
(b) B= {0,1,2,3,7,9,10} (g) A-B= {5,8} 
(ec) ANA=@ _ 

(d) AUA = {0,1,2,3,4,5,6,7,8,9,10 <u ) 408 = 58) 


(e) A-A=A (i) ANB ={0,1,2,3,4,6,7,9, 10} 


3. Sketch the set X = [1,3] x [1,2] on the plane R?. On separate drawings, shade in 
the sets X, and X n([0,2] x [0,3]). 


3 3 3 
2 2 2 
x 
1 | 1 
< 


> < > < 
1([1:23 7 Ein a 
-1 -1y Xn ((0,2]x [0,3]) 


5. Sketch the set X = {(x,y)€R?:1<x?+y?<4} on the plane R’. On a separate 
drawing, shade in the set X. 


Solution of 1.6, #5. Solution of 1.7, #1. 


Section 1.7 
1. Draw a Venn diagram for A (solution above right). 
3. Draw a Venn diagram for (A—B)NC. 
Scratch work is shown on the right. The set 


A-B is indicated with vertical shading. The C C 
set C is indicated with horizontal shading. 
The intersection of A—B and C is thus the 
overlapping region that is shaded with both 7 
vertical and horizontal lines. The final an- iwi 
swer is drawn on the far right, where the set = AUIIIID A 
(A-B)nC is shaded in gray. 
5. Draw Venn diagrams for AU(BNC) and (AUB)n(AUC). Based on your drawings, 
do you think A U(BNC) = (AUB)n(AUC)? 


Richard Hammack Book of Proof 


297 


If you do the drawings carefully, you will find C 
that your Venn diagrams are the same for both 

AU(BNC) and (AUB)n(AUC). Each looks as 
illustrated on the right. Based on this, we are 

inclined to say that the equation AU(BNC) = 
(AUB)n(AUC) holds for all sets A, B and C. A 


7. Suppose sets A and B are in a universal set U. Draw Venn diagrams for ANB 
and AUB. Based on your drawings, do you think it’s true that ANB = AUB? 


The diagrams for ANB and AUB look exactly 
alike. In either case the diagram is the shaded 
region illustrated on the right. Thus we would i 

expect that the equation ANB = AUB is true for 

any sets A and B. 


U 


9. Venn diagram for (ANB)-C: 11. The simplest answer is (BNC)-A. 
C 13. One answer is (AUBUC)-(ANBnC). 


A B 


Section 1.8 


1. Suppose A, = {a,b,d,e,g,f}, Ag = {a,b,c,d}, A3 = {b,d,a} and A4 = {a,b,h}. 
4 4 


(a) UJ Aji = {a,b,c,d,e,f,g,h} (b) (Ai = {a,b} 
i=1 i=1 
3. For each neN, let A, = {0,1,2,3,...,n}. 
(a) UA; = {0}UN (b) () Ai = {0,1} 
ieN ieN 
5. (a) Uli,i+ 1] =[1,00) (b) (\li,i+1]=0 
ieN ieN 
7. (a) URx[i,i+ll= (x,y): x, yeR, y= 1} (b) (|Rxfi,i+1]l=@ 
ieN ieN 
9. (a) LU X=N (b) (]) X=¢ 
Xe PAN) XePYA(N) 


11. Yes, this is always true. 
13. The first is true, the second is false. 


Chapter 2 Exercises 
Section 2.1 


1. Every real number is an even integer. (Statement, False) 
3. If x and y are real numbers and 5x = 5y, then x= y. (Statement, True) 


5. Sets Z and N are infinite. (Statement, True) 


Free PDF version [S)= 


298 Solutions 


7. The derivative of any polynomial of degree 5 is a polynomial of degree 6. (State- 
ment, False) 


9. cos(x) = -1 
This is not a statement. It is an open sentence because whether it’s true or false 
depends on the value of x. 


11. The integer x is a multiple of 7. 
This is an open sentence, and not a statement. 


13. Either x is a multiple of 7, or it is not. 
This is a statement, for the sentence is true no matter what x is. 


15. In the beginning God created the heaven and the earth. 
This is a statement, for it is either definitely true or definitely false. There is 
some controversy over whether it’s true or false, but no one claims that it is 
neither true nor false. 


Section 2.2 


Express each statement as one of the forms PAQ, PvQ, or ~ P. Be sure to also 
state exactly what statements P and Q stand for. 


1. The number 8 is both even and a power of 2. 
PaQ 
P: 8 is even 
Q: 8 is a power of 2 
Note: Do not say “Q: a power of 2,” because that is not a statement. 


3. x4¥ ~(x=y) (Also ~ P where P :x =.) 
5. yEx ~(y<x) (Also ~ P where P: y <x.) 
7. The number x equals zero, but the number y does not. 
PA~Q 
P:x=0 
Q:y=0 
9. xe A-B 


(xe A)A ~ (x € B) 
11. Ae {X € AN) : |X| <oo} 
(A SN) A(JA| < co). 
13. Human beings want to be good, but not too good, and not all the time. 
PA~QA~R 
P: Human beings want to be good. 
Q@ : Human beings want to be too good. 
R: Human beings want to be good all the time. 


Section 2.3 


Without changing their meanings, convert each of the following sentences into a 
sentence having the form “If P, then Q.” 


Richard Hammack Book of Proof 


13. 


299 


. A matrix is invertible provided that its determinant is not zero. 


Answer: If a matrix has a determinant not equal to zero, then it is invertible. 


. For a function to be continuous, it is necessary that it is integrable. 


Answer: If a function is continuous, then it is integrable. 


. An integer is divisible by 8 only if it is divisible by 4. 


Answer: If an integer is divisible by 8, then it is divisible by 4. 


. Aseries converges whenever it converges absolutely. 


Answer: If a series converges absolutely, then it converges. 


. A function is integrable provided the function is continuous. 


Answer: If a function is continuous, then that function is integrable. 


. You fail only if you stop writing. 


Answer: If you fail, then you have stopped writing. 


Whenever people agree with me I feel I must be wrong. 
Answer: If people agree with me, then I feel I must be wrong. 


Section 2.4 


Without changing their meanings, convert each of the following sentences into a 
sentence having the form “P if and only if Q.” 


1. 


For a matrix to be invertible, it is necessary and sufficient that its determinant 
is not zero. 
Answer: A matrix is invertible if and only if its determinant is not zero. 


. If xy =0 then x =0 or y=0, and conversely. 


Answer: xy = 0 if and only if x=0 or y=0 


. For an occurrence to become an adventure, it is necessary and sufficient for one 


to recount it. 
Answer: An occurrence becomes an adventure if and only if one recounts it. 


Section 2.5 


1. 


Write a truth table for Pv(Q=>R) 8. Write a truth table for ~(P>Q) 


PlaelR|e=R| Pve=R) P/Q] Ppsel]~wse@) 
Fl r|) P| # T ae ae. F 
T|T|F| F T T|F| F T 
T|F|T] T T F/|T| T F 
T|F\|F| T T FlFi| T F 
FIT|Tl| T 
FIT|F| F F 
FI\F|T| T T 
FI\F\|F| T T 


Free PDF version [S)= 


300 


Solutions 


5. Write a truth table for (PA~P)vQ 7. Write a truth table for (PA ~P)>Q 


P|Q | (Pa~P) || (Pa~P)v@Q P|Q| @a~P) | (Pa~P)=Q 
T|T F T T|T F T 
T\F F F T\F F T 
F\T F T F\T F T 
F\F F F F\F F T 
9. Write a truth table for ~(~ Pv ~Q). 

P/Q] ~P |~Q@ | ~Pv~Q]|| ~(~Pv~Q) 

T|T F F F T 

T\|F F T T F 

F\T T F T F 

F\F T T T F 


11. Suppose P is false and that the statement (R > S)@(P AQ) is true. Find the 


truth values of R and S. (This can be done without a truth table.) 


Answer: Since P is false, it follows that (P \Q) is false also. But then in order 
for (R > S)(P. AQ) to be true, it must be that (R = S) is false. The only way for 
(R = S) to be false is if 


Section 2.6 
1. PAQVR)=(PAQ)V(PAR) 


R is true and S is false. 


P/Q/R| QvR | PaQ | PaR | PAQvVR) | (PAQ)VCrR) 
T|T|T T iy T T T 
ae as T T F T T 
T|F|T T F T T T 
T|F|F F F F F F 
F|T|T T F F F F 
FIT|F T F F F F 
F\|F|T T F F F F 
F\F|F F F F F F 


Thus since the columns agree, the two statements are logically equivalent. 
3. P>Q=(~P)vQ 


P|Q|~P | (~P)vQ|P=>Q 
T \:f F T T 
T|\F F F F 
F\T T T T 
F\F T T T 


Richard Hammack Book of Proof 


Since the columns agree, the two 
statements are logically equivalent. 


301 


5. ~(PVQVR) = (~P)A(~Q)A(~R) 
P|\Q PvQVR | ~P|~Q|~R | ~@vQVR) | (~P)A~QAH~R) 
T\|T\|T T F F F F F 
T|T\F T F F T F F 
T\|F\T T F T F F F 
T|F\F T F T T F F 
F\T|T T T F F F F 
F\T\|F T T F T F F 
F\F\|T T T T F F F 
F\F\F F T T T T T 
Since the columns agree, the two statements are logically equivalent. 
7 P=>Q = (PA~Q)>@QA~Q) 
P\|QI~Q | PA~Q |] Qa~Q |] (PA~Q)>(QA~Q) | P>Q 
T|T F F F T T 
T\|F T hi F F F 
F\T F F F T T 
F\F T F F T T 
Since the columns agree, the two statements are logically equivalent. 
9. By DeMorgan’s law, we have ~ (~ Pv ~Q) =~~ PA ~~Q =P AQ. Thus the two 
statements are logically equivalent. 
11. (~P)A(P=>Q) and ~(Q=>P) 
P/Q|i~P|P3>Q/Q>P/ ©/P)AP=Q)|~Q=>P) 
T\|T | F T T F F 
T\|F F F T F F 
F\|T T T F T T 
F\F T T T T F 
The columns for the two statements do not quite agree, thus the two statements 
are not logically equivalent. 
13. Pv(QAR) and (PVQ)AR are not logically equivalent because if P = T and 
Q =R =F, then the first statement is true and the second is false. 
Section 2.7 


Write the following as English sentences. Say if the statements are true or false. 


1. 


VxeER,x2>0 

Answer: For every real number «x, x? > 0. 

Also: For every real number x, it follows that x? > 0. 

Also: The square of any real number is positive. (etc.) 

Statement is false. Reason: 0 is a real number, but it’s not true that 0? > 0. 


Free PDF version [S)= 


302 Solutions 


- Ja eR, VxeR ax=x. 


Answer: There exists a real number a for which ax = x for every real number «x. 
This statement is TRUE. Reason: Consider a = 1. 


. VneN,lAxX € AN), |X| <n 


Answer: For every natural number n, there is a subset X of N with |X| <n. 
This statement is TRUE. Reason: Suppose neN. Let X =@. Then |X|=0<n. 


. VX CN Ane Z,|X|=n 


Answer: For any subset X of N, there exists an integer n for which |X| =n. 
This statement is FALSE. For example, the set X = {2,4,6,8,...} of all even natural 
numbers is infinite, so there does not exist any integer n for which |X| =n. 


» VneZ,imeZ,m=n+5 


Answer: For every integer n there is another integer m such that m=n+65. 
This statement is TRUE. 


Section 2.9 


Translate each of the following sentences into symbolic logic. 


1. 


3. 


5. 


If f is a polynomial and its degree is greater than 2, then /’ is not constant. 
Translation: (P \Q)=>R, where 

P:f is a polynomial, 

Q:f has degree greater than 2, 

R:f' is not constant. 


If x is prime then ,/x is not a rational number. 

Translation: P >~ Q, where 

P:x is prime, 

Q: /x is a rational number. 

For every positive number «¢, there is a positive number 6 for which |x-a| <6 


implies |f(«)- f(a)| <e. 
Translation: Ve€R,e>0,46€R,6 > 0,(|x-a| <6) => (lf (x) - F(a) < ©) 


7. There exists a real number a for which a+ x =x for every real number x. 


11. 


Translation: JaeR,VxeRat+x=x 


. Ifx is a rational number and x 40, then tan(x) is not a rational number. 


Translation: ((x € Q) A (x 4 0)) > (tan(x) ¢ Q) 


There is a Providence that protects idiots, drunkards, children and the United 
States of America. 

One translation is as follows. Let R be union of the set of idiots, the set of 
drunkards, the set of children, and the set consisting of the USA. Let P be the 
open sentence P(x): x is a Providence. Let S be the open sentence S(x,y): x 
protects y. Then the translation is 4x,V y¢ R,P(x) A S(x,y). 

(Notice that, although this is mathematically correct, some humor has been lost 
in the translation.) 


Richard Hammack Book of Proof 


13. 


303 


Everything is funny as long as it is happening to somebody else. 

Translation: V.x,(~ M(x) A S(x)) => F(x), 

where M(x): x is happening to me, S(x): x is happening to someone, and F(x): x 
is funny. 


Section 2.10 


Negate the following sentences. 


1. 


11. 


The number x is positive, but the number y is not positive. 
The “but” can be interpreted as “and.” Using DeMorgan’s law, the negation is: 
The number x is not positive or the number y is positive. 


. For every prime number p, there is another prime number q with q > p. 


Negation: There is a prime number p such that for every prime number q, q < p. 
Also: There exists a prime number p for which q < p for every prime number q. 
(etc.) 


. For every positive number « there is a positive number M for which |f(x)-b| <e 


whenever x>M. 

To negate this, it may be helpful to first write it in symbolic form. The statement 
is Ve € (0,00), 4M € (0,00), (x > M) => (|f(x)- b| <e). 

Working out the negation, we have 


~ (Ve € (0,00), 4M € (0,00), (x > M) => (|f(x)-bl<e)) = 
de € (0,00), ~ (AM € (0,00), (x > M) = (|f (x) - b| < €)) 
Je € (0,00), VM € (0,00), ~ ((x > M) => (lf (x) - b| < e)). 


Finally, using the idea from Example 2.15, we can negate the conditional state- 
ment that appears here to get 


de € (0,00), VM € (0,00), dx, (x > M)a ~ (If (x)- b] <e). 


Negation: There exists a positive number e with the property that for every positive 
number M, there is a number x for which x > M and |f(x)- b|= «. 


. I don’t eat anything that has a face. 


Negation: I will eat some things that have a face. 
(Note: If your answer was “I will eat anything that has a face.” then that is 
wrong, both morally and mathematically.) 


. If sin(x) <0, then it is not the case that O<«<1z. 


Negation: There exists a number x for which sin(x) <0 and 0s x17. 


You can fool all of the people all of the time. 


There are several ways to negate this, including: 

There is a person that you can’t fool all the time. or 

There is a person x and a time y for which x is not fooled at time y. 
(But Abraham Lincoln said it better.) 


Free PDF version [S)= 


304 Solutions 


Chapter 3 Exercises 
Section 3.2 


1. Consider lists made from the letters T, H, E, O, R, Y, with repetition allowed. 

(a) How many length-4 lists are there? Answer: 6-6-6-6= 1296. 

(b) How many length-4 lists are there that begin with T? 

Answer: 1-6-6-6= 216. 
(c) How many length-4 lists are there that do not begin with T? 
Answer: 5-6-6-6= 1080. 
3. How many ways can you make a list of length 3 from symbols 4,B,c,pD,£,F if... 

(a) ... repetition is allowed. Answer: 6-6-6= 216. 

(b) ... repetition is not allowed. Answer: 6-5-4= 120. 

(ec) ... repetition is not allowed and the list must contain the letter a. 

Answer: 5-4+5-4+5-4= 60. 

(d) ... repetition is allowed and the list must contain the letter a. 
Answer: 6-6-6—5-5-5= 91. 

(Note: See Example 3.3 if a more detailed explanation is required.) 

5. This problems involves 8-digit binary strings such as 10011011 or 00001010. (i.e., 

8-digit numbers composed of 0’s and 1’s.) 

(a) How many such strings are there? Answer: 2:2-2-2-2-2-2-2= 256. 

(b) How many such strings end in 0? Answer: 2-2-2-2-2-2-2-1= 128. 

(c) How many such strings have the property that their second and fourth digits 
are 1’s? Answer: 2-1-2-1-2-2-2-2= 64. 

(d) How many such strings are such that their second or fourth digits are 1’s? 
Solution: These strings can be divided into three types. Type 1 consists of 
those strings of form «1*0« « « «, Type 2 consist of strings of form «0 *1* * * x, 
and Type 3 consists of those of form *1*1* *« «*. By the multiplication principle 
there are 2° = 64 strings of each type, so there are 3-64 = 192 8-digit binary 
strings whose second or fourth digits are 1’s. 

7. This problem concerns 4-letter codes made from the letters A,B,C,D.,,...,Z. 

(a) How many such codes can be made? Answer: 26- 26-26-26 = 456,976 

(b) How many such codes have no two consecutive letters the same? 

Solution: We use the multiplication principle. There are 26 choices for the first 
letter. The second letter can’t be the same as the first letter, so there are only 
25 choices for it. The third letter can’t be the same as the second letter, so there 
are only 25 choices for it. The fourth letter can’t be the same as the third letter, 
so there are only 25 choices for it. Thus there are 26-25 -25-25 = 406, 250 
codes with no two consecutive letters the same. 

9. A new car comes in a choice of five colors, three engine sizes and two transmis- 

sions. How many different combinations are there? Answer 5-3-2 = 830. 


Richard Hammack Book of Proof 


305 


Section 3.3 


1. Five cards are dealt off of a standard 52-card deck and lined up in a row. How 
many such lineups are there that have at least one red card? 


Solution: All together there are 52-51-50-49-48 = 311875200 possible lineups. 
The number of lineups that do not have any red cards (i.e. are made up only 
of black cards) is 26-25 -24-23-22 = 7,893,600. By the subtraction principle, the 
answer to the question is 311,875,200 — 7,893, 600 = 303,981,600. 


How many such lineups are there in which the cards are all black or all hearts? 


Solution: The number of lineups that are all black is 26-25-24-23-22 = 7,893,600. 
The number of lineups that are hearts (which are red) is 13-12-11-10-9 = 154,440. 
By the addition principle, the answer to the question is 7,893,600 + 154,440 = 
8,048,040. 


3. Five cards are dealt off of a standard 52-card deck and lined up in a row. How 
many such lineups are there in which all 5 cards are of the same color (i.e., all 
black or all red)? 


Solution: There are 26 -25-24-23-22 = 7,893,600 possible black-card lineups and 
26-25-24-23-22 = 7,893,600 possible red-card lineups, so by the addition principle 
the answer is 7,893,600 + 7,893,600 = 15, 787, 200. 
5. How many integers between 1 and 9999 have no repeated digits? 

Solution: Consider the 1-digit, 2-digit, 3-digit and 4-digit number separately. 
The number of 1-digit numbers that have no repeated digits is 9 (i.e., all of 
them). The number of 2-digit numbers that have no repeated digits is 9-9 = 81. 
(The number can’t begin in 0, so there are only 9 choices for its first digit.) The 
number of 3-digit numbers that have no repeated digits is 9-9-8 = 648. The 
number of 4-digit numbers that have no repeated digits is 9-9-8-7 = 4536. By 
the addition principle, the answer to the question is 9+ 81+ 648 + 4536 = 5274. 


How many integers between 1 and 9999 have at least one repeated digit? 


Solution: The total number of integers between 1 and 9999 is 9999. Using the 
subtraction principle, we can subtract from this the number of digits that have 
no repeated digits, which is 5274, as above. Therefore the answer to the question 
is 9999 —- 5274 = 4725. 


7. A password on a certain site must have five characters made from letters of the 
alphabet, and there must be at least one upper case letter. How many different 
passwords are there? 


Solution: Let U be the set of all possible passwords made from a choice of upper 
and lower case letters. Let X be the set of all possible passwords made from 
lower case letters. Then U-X is the set of passwords that have at least one upper 
case letter. By the subtraction principle our answer will be |U — X| = |U|-|X|. 
All together, there are 26 + 26 = 52 upper and lower case letters, so by the multi- 
plication principle |U| = 52-52-52 -52-52 = 52° = 380, 204, 032. 

Likewise |X| = 26-26-26-26-26 = 26° = 11,881,376. 

Thus the answer is |U|— |X| = 380, 204, 032 — 11,881,376 = 368,322, 656. 


Free PDF version [S)= 


306 Solutions 


11. 


What if there must be a mix of upper and lower case? 


Solution: The number of passwords using only upper case letters is 26° = 
11,881,376, and, as calculated above, this is also the number of passwords that 
use only lower case letters. By the addition principe, the number of passwords 
that use only lower case or only upper case is 11,881, 376+11, 881,376 = 23, 762, 752. 
By the subtraction principle, the number of passwords that use a mix of upper 
and lower case it the total number of possible passwords minus the number 
that use only lower case or only upper case, namely 380, 204,032 — 23,762, 752 = 
356, 441, 280. 


. This problem concerns lists of length 6 made from the letters A,B,C,D,E,F,G,H. 


How many such lists are possible if repetition is not allowed and the list contains 
two consecutive vowels? 


Solution: There are just two vowels A and E to choose from. The lists we want 
to make can be divided into five types. They have one of the forms VV « * « x, 
or *«VVxxx«, or **VVxx, or ***xVVx, or *** * VV, where V indicates a 
vowel and * indicates a consonant. By the multiplication principle, there are 
2-1-6-5-4-3= 720 lists of form VV * « * x. In fact, that for the same reason there 
are 720 lists of each form. Thus by the addition principle, the answer to the 
question is 720+ 720+ 720+ 720 + 720 = 3600 


How many integers between 1 and 1000 are divisible by 5? How many are not? 
Solution: The integers that are divisible by 5 are 5,10, 15,20,...,995,1000. There 


are 1000/5 = 200 such numbers. By the subtraction principle, the number that 
are not divisible by 5 is 1000 — 200 = 800. 


Sections 3.4 


11. 


13. 


. Answer n= 14. 5. 20 = ote 18! = 120-119 = 14,280. 
. Answer: 5!= 120. 7. Answer: 5!4! = 2880. 
. How many permutations of the letters A,B,C,D,E,F,G are there in which the 


three letters ABC appear consecutively, in alphabetical order? 


Solution: Regard ABC as a single symbol | ABC |. Then we are looking for the 


number of permutations of the five symbols | ABC |, D, E, F, G. The number of 
such permutations is 5! = 120. 


You deal 7 cards off of a 52-card deck and line them up in a row. How many 
possible lineups are there in which not all cards are red? 

Solution: All together, there are P(52,7) 7-card lineups with cards selected from 
the entire deck. And there are P(26,7) 7-card lineups with red cards selected 
from the 26 red cards in the deck. By the subtraction principle, the number of 
lineups that are not all red is P(52,7)— P(26,7) = 670,958,870,400. 


P(26,6)= 165,765,600 15. P(15,4) = 32,760 17. P(10,3) = 720 


Richard Hammack Book of Proof 


307 


Section 3.5 


1. 


11. 


13. 
15. 


17. 


19. 


Suppose a set A has 37 elements. How many subsets of A have 10 elements? 
How many subsets have 30 elements? How many have 0 elements? 
Answers: (7) = 348,330,136; (3”) = 10,295,472; (37) = 1. 


. Aset X has exactly 56 subsets with 3 elements. What is the cardinality of X? 


Solution: The answer will be the n for which (3) =56. After some trial and error, 
you will discover (8) = 56, so |X| =8. 


. How many 16-digit binary strings contain exactly seven 1’s? 


Solution: Make such a string as follows. Start with a list of 16 blank spots. 
Choose 7 of the blank spots for the 1’s and put 0’s in the other spots. There are 
(78) = 11,440 ways to do this. 


» [{X € P({0,1,2,3,4,5, 6, 7,8, 9}) : LX] < 43] = (40)+ (10) +(9)+( 2) =1410+45+120 = 176. 
. This problem concerns lists of length six made from the letters A,B,C,D,E,F, 


without repetition. How many such lists have the property that the D occurs 
before the A? 

Solution: Make such a list as follows. Begin with six blank spaces and select two 
of these spaces. Put the D in the first selected space and the A in the second. 
There are (§) = 15 ways of doing this. For each of these 15 choices there are 
4! = 24 ways of filling in the remaining spaces. Thus the answer to the question 
is 15 x 24= 360 such lists. 


How many 10-digit integers contain no 0’s and exactly three 6’s? 

Solution: Make such a number as follows: Start with 10 blank spaces and choose 
three of these spaces for the 6’s. There are (4?) = 120 ways of doing this. For 
each of these 120 choices we can fill in the remaining seven blanks with choices 
from the digits 1,2,3,4,5,7,8,9, and there are 8” to do this. Thus the answer to 
the question is (1?) -8” = 251,658,240. 


‘ ! ! ! 
Assume n,k € Z with 0<k <n. Then (7) = OEE = BiG! = GoG@cHnchl = (ae 


How many 10-digit binary strings are there that do not have exactly four 1’s? 
Solution: All together, there are 21° different binary strings. The number of 
10-digit binary strings with exactly four 1’s is ea? because to make one we need 
to choose 4 out of 10 positions for the 1’s and fill the rest in with 0’s. By the 
subtraction principle, the answer to our questions is 2!° — eu: 


How many 10-digit binary numbers are there that have exactly four 1’s or exactly 
five 1’s? 

Solution: By the addition principle the answer is (‘?) + (9). 
How many do not have exactly four 1’s or exactly five 1’s? 

Solution: By the subtraction principle combined with the answer to the first 


part of this problem, the answer is 21° — (1?) - (‘9) 


A 5-card poker hand is called a flush if all cards are the same suit. How many 
different flushes are there? 


Free PDF version [S)= 


308 Solutions 


Solution: There are (2) = 1287 5-card hands that are all hearts. Similarly, there 
are (}) = 1287 5-card hands that are all diamonds, or all clubs, or all spades. By 
the addition principle, there are then 1287 + 1287 + 1287 + 1287 = 5148 flushes. 


Section 3.6 


1. 


11. 


. Use the binomial theorem to find the coefficient of x® in (x +2 


Write out Row 11 of Pascal’s triangle. 

Answer: 1 11 55 165 330 462 462 330 165 55 11 1 

13, 

Answer: According to the binomial theorem, the coefficient of x®y® in (x + y)!8 is 
(3) x8 y° = 1287x°y°. Now plug in y= 2 to get the final answer of 41184x°. 


. Use the binomial theorem to show )°?_,(j,) = 2”. Hint: Observe that 2” =(1+1)". 


Now use the binomial theorem to work out (x+ y)” and plug in x=1 and y=1. 


. Use the binomial theorem to show 57_, 3" (;) =4”. 


Hint: Observe that 4” = (1+ 3)". Now look at the hint for the previous problem. 


. Use the binomial theorem to show (§)-(7)+(5)-(3)+(4)-(3)+-.-+(%) = 0. Hint: 


Observe that 0 = 0” =(1+(-1))”. Now use the binomial theorem. 


Use the binomial theorem to show 9” = 57_,(-1)* (7) 10"*. 
Hint: Observe that 9” = (10+ (—1))”. Now use the binomial theorem. 


13, Assume n> 3. Then (3)=("5')+("2'))=("3) +("2)+("2:) = =@)+Q) +--+ ("2)- 
Section 3.7 
1. At acertain university 523 of the seniors are history majors or math majors (or 


both). There are 100 senior math majors, and 33 seniors are majoring in both 
history and math. How many seniors are majoring in history? 

Solution: Let A be the set of senior math majors and B be the set of senior 
history majors. From |AUB| = |A|+|B|—-|AnB| we get 523 = 100+ |B|- 33, so 
|B| = 523+ 33-100 = 456. There are 456 history majors. 


. How many 4-digit positive integers are there that are even or contain no 0’s? 


Solution: Let A be the set of 4-digit even positive integers, and let B be the set of 
4-digit positive integers that contain no 0’s. We seek |AUB|. By the multiplication 
principle |A| = 9-10-10-5 = 4500. (Note the first digit cannot be 0 and the last 
digit must be even.) Also |B| =9-9-9-9=6561. Further, An B consists of all even 
4-digit integers that have no 0’s. It follows that |ANB| =9-9-9-4=2916. Then the 
answer to our question is |A UB|= |A|+|B|—|AnB| = 4500 + 6561 — 2916 = 8145. 


. How many 7-digit binary strings begin in 1 or end in 1 or have exactly four 1’s? 


Solution: Let A be the set of such strings that begin in 1. Let B be the set of such 
strings that end in 1. Let C be the set of such strings that have exactly four 1’s. 
Then the answer to our question is |AUBUC|. Using Equation (3.5) to compute 
this number, we have |AUBUC| = |A|+|B|+|C|-—|ANB|-—|ANC|-|BnC|+|AnNBnC| = 
2° +26 + (7) — 2° — (8) — (8) +(3) = 64+ 64+ 35 -—32-20-20+ 10= 101. 


. This problem concerns 4-card hands dealt off of a standard 52-card deck. How 


many 4-card hands are there for which all four cards are of the same suit or all 
four cards are red? 


Richard Hammack Book of Proof 


11. 


13. 


15. 


309 


Solution: Let A be the set of 4-card hands for which all four cards are of the 
same suit. Let B be the set of 4-card hands for which all four cards are red. 
Then ANB is the set of 4-card hands for which the four cards are either all 
hearts or all diamonds. The answer to our question is |A UB| =|A|+|B|-|AnB| = 
4(13) + (78) —2( 33) = 2( 3) + (76) = 1430+ 14,950 = 16,380. 


. A 4-letter list is made from the letters L,I,S,T.E,D according to the following 


rule: Repetition is allowed, and the first two letters on the list are vowels or the 
list ends in D. How many such lists are possible? 

Solution: Let A be the set of such lists for which the first two letters are vowels, so 
|A| = 2-2-6-6 = 144. Let B be the set of such lists that end in D, so |B| = 6-6-6-1 = 216. 
Then ANB is the set of such lists for which the first two entries are vowels and 
the list ends in D. Thus |AnB|=2-2-6-1= 24. The answer to our question is 
|AUB|=|A|+|B|-|AnB| = 144+ 216-24 = 336. 


How many 7-digit numbers are even or have exactly three digits equal to 0? 
Solution: Let A be the set of 7-digit numbers that are even. By the multiplication 
principle, |A| = 9-10-10-10-10-10-5 = 4,500,000. Let B be the set of 7-digit numbers 
that have exactly three digits equal to 0. Then |B| =9-(8)-9-9-9. (First digit is 
anything but 0. Then choose 3 of 6 of the remaining places in the number for 
the 0’s. Finally the remaining 3 places can be anything but 0.) 

Note AnB is the set of 7-digit numbers that are even and contain exactly three 
0’s. We can compute |AnB| with the addition principle, by dividing AnB into 
two parts: the even 7-digit numbers with three digits 0 and the last digit is not 
0, and the even 7-digit numbers with three digits 0 and the last digit is 0. The 
first part has 9-(3)-9-9-4 elements. The second part has 9-(3)-9-9-9-1 elements. 
Thus |An Bl =9-(2)-9-9-4+9-(3)-9-9-9. 

By the inclusion-exclusion formula, the answer to our question is |A UB| = |A|+ 
|B|-|AnB| = 4,500,000 + 94(8) — 99 (2) -4-94(3) = 4,536, 450. 

How many 8-digit binary strings end in 1 or have exactly four 1’s? 

Solution: Let A be the set of strings that end in 1. By the multiplication principle 
|A| = 2”. Let B be the number of strings with exactly four 1’s. Then |B| = (°) 
because we can make such a string by choosing 4 of 8 spots for the 1’s and 
filling the remaining spots with 0’s. Then AnB is the set of strings that end 
with 1 and have exactly four 1’s. Note that |AnB| = (j) (make the last entry 
a 1 and choose 3 of the remaining 7 spots for 1’s). By the inclusion-exclusion 
formula, the number 8-digit binary strings that end in 1 or have exactly four 1’s 
is |AUB| =|Al+|B|-|An Bl =2 + (8) - (4 = 168. 

How many 10-digit binary strings begin in 1 or end in 1? 

Solution: Let A be the set of strings that begin with 1. By the multiplication 
principle |A| = 2°. Let B be the number of strings that end with 1. By the 
multiplication principle |B| = 29. Then ANB is the set of strings that begin 
and end with 1. By the multiplication principle |AnB| = 2°. By the inclusion- 
exclusion formula, the number 10-digit binary strings begin in 1 or end in 1 is 
|A UB| =|Al+|B|—|AnB| = 2° + 29 — 28 = 768. 


Free PDF version [S)= 


310 Solutions 


Section 3.8 


1. 


11. 


How many 10-element multisets can be made from the symbols {1, 2,3, 4}? 
Answer: (1°%4"1) = (12) = 286. 


. You have a dollar in pennies, a dollar in nickels, a dollar in dimes and a dollar 


in quarters. You give four coins to a friend. In how many ways can this be done? 


Solution: In giving your friend four coins, you are giving her a 4-element multiset 
made from elements in {1,5, 10,25}. There are (4*47') = (j) = 35 such multisets. 


. Abag contains 20 identical red balls, 20 identical blue balls, 20 identical green 


balls, and one white ball. You reach in and grab 15 balls. How many different 
outcomes are possible? 


Solution: First we count the number of outcomes that don’t have a white ball. 
Modeling this with stars and bars, we are looking at length-17 lists of the form 


red blue green 


where there are 15 stars and two bars. Therefore there are Ga outcomes without 
the white ball. Next we count the outcomes that do have the white ball. Then 
there are 14 remaining balls in the grab. In counting the ways that they can 
be selected we can use the same stars-and-bars model above, but this time the 
list is of length 16 and has 14 stars. There are GA) outcomes. Finally, by the 


addition principle, the answer to the question is (}/) + ({8) = 256. 


. In how many ways can you place 20 identical balls into five different boxes? 


Solution: Let’s model this with stars and bars. Doing this we get a list of length 
24 with 20 stars and 4 bars, where the first grouping of stars has as many stars 
as balls in Box 1, the second grouping has as many stars as balls in Box 2, and 


so on. 
Box 1 Box 2 Box 3 Box 4 Box 5 


SESS, 


—e—_. ————_S , oS, OOo ~ 


The number of ways to place 20 balls in the five boxes equals the number of such 
lists, which is ($5) = 10,626. 


. A bag contains 50 pennies, 50 nickels, 50 dimes and 50 quarters. You reach in 


and grab 30 coins. How many different outcomes are possible? 
Solution: The stars-and-bars model is 


pennies _ nickels dimes quarters 


——“———_F |) nS, |) OOo, EO EO 


so there are (33) = 5456 outcomes. 


How many integer solutions does the equation w+x+y+z=100 have if w = 4, 
x22, y=0andz=0? 


Richard Hammack Book of Proof 


13. 


15. 


17. 


19. 


21. 


311 


Solution: Imagine a bag containing 100 red balls, 100 blue balls, 100 green balls 
and 100 white balls. Each solution of the equation corresponds to an outcome in 
selecting 100 balls from the bag, where the selection includes w = 4 red balls, 
x= 2 blue balls, y=0 green balls and z = 0 white balls. 


Now let’s consider making such a selection. Pre-select 4 red balls and 2 blue 
balls, so 94 balls remain in the bag. Next the remaining 94 balls are selected. 
We can calculate the number of ways that this selection can be made with stars 
and bars, where there are 94 stars and 3 bars, so the list’s length is 97. 


red blue green white 


ee 


—— ——“—_—__ -_-_— 


The number of outcomes is thus (°/) = 147,440. 

How many length-6 lists can be made from the symbols {4, B, c, D, £, F, G}, if 
repetition is allowed and the list is in alphabetical order? 

Solution: Any such list corresponds to a 6-element multiset made from the 
symbols {A, B, C, D, E, F, G}. For example, the list aacppe corresponds to the 
multiset [a,a,c,D,D,G]. Thus the number of lists equals the number of multisets, 
which is (°*7-') = ((@) = 924. 

How many permutations are there of the letters in the word “TENNESSEE”? 
Solution: By Fact 3.8, the answer is an = 3,780. 

You roll a dice six times in a row. How many possible outcomes are there that 
have two 1’s three 5’s and one 6? 

Solution: This is the number of permutations of the “word” DOHEHH. By 
Fact 3.8, the answer is eH = 60. 


In how many ways can you place 15 identical balls into 20 different boxes if each 
box can hold at most one ball? 

Solution: Regard each such distribution as a binary string of length 20, where 
there is a 1 in the ith position precisely if the ith box contains a ball (and zeros 
elsewhere). The answer is the number of permutations of such a string, which 
by Fact 3.8 is ah =15,504. Alternatively, the answer is the number of ways to 
choose 15 positions out of 20, which is G) = 15,504. 

How many numbers between 10,000 and 99,999 contain one or more of the digits 
3, 4 and 8, but no others? 

Solution: First count the numbers that have three 3’s, one 4, and one 8, like 
33,348. By Fact 3.8, the number of permutations of this is we = 20. 

By the same reasoning there are 20 numbers that contain three 4’s, one 3, and 
one 8, and 20 numbers that contain three 8’s, one 3, and one 4. 

Next, consider the numbers that have two 3’s, two 4’s and one 8, like 33,448. By 
Fact 3.8, the number of permutations of this is xen = 80. 

By the same reasoning there are 30 numbers that contain two 3’s, two 8’s and 
one 4, and 30 numbers that contain two 4’s, two 8’s and one 3. This exhausts all 
possibilities. By the addition principle the answer is 20+ 20+ 20+30+30+30 = 150. 


Free PDF version [S)= 


312 Solutions 


Section 3.9 


1. Show that if 6 integers are chosen at random, at least two will have the same 
remainder when divided by 5. 


Solution: Pick six integers n1,n2,n3,n4,n5 and ng at random. Imagine five boxes, 
labeled Box 0, Box 1, Box 2, Box 3, Box 4. Each of the picked integers has a 
remainder when divided by 5, and that remainder is 0, 1, 2, 3 or 4. For each nj, 
let r; be its remainder when divided by 5. Put n; in Box r;. We have now put six 
numbers in five boxes, so by the pigeonhole principle one of the boxes has two or 
more of the picked numbers in it. Those two numbers have the same remainder 
when divided by 5. 


3. What is the fewest number of times you must roll a six-sided dice before you can 
be assured that 10 or more of the rolls resulted in the same number? 


Solution: Imagine six boxes, labeled 1 through 6. Every time you roll a ©), put 
an object in Box 1. Every time you roll a £), put an object in Box 2, etc. After 
n rolls, the division principle says that one box contains [%] objects, and this 
means you rolled the same number [4] times. We seek the smallest n for which 
[%| 210. This is the smallest n for which | >9, that is n >9-6=54. Thus the 
answer is n = 55. You need to roll the dice 55 times. 
5. Prove that any set of 7 distinct natural numbers contains a pair of numbers 
whose sum or difference is a multiple of 10. 
Solution: Let S be any set of 7 natural numbers. We want to show the sum or 
difference of two of them is a multiple of 10. Take six boxes labeled as follows: 
1,9) |2,8 |8,7) |4,6) [5] |0 
Notice that if a box has two numbers on it, then they add up to 10. Take the 
numbers from S and put them into the boxes in the following way: For each 
x€S, look at its rightmost (one’s) digit, and put it in the box that has that digit 
on it. (For example, put 253 in box | 3,7 |. Also put 17 and 13 into| 3,7 |. Put 91 in 


1,9 |; put 55 in| 5| and put 100 into|0|, etc.) Because S has more elements than 
there are boxes, at least one box will contain two (or more) numbers. Take two 
numbers x and y that are in the same box. If their rightmost digits happen to be 
the same, then the rightmost digit of the difference x— yy is 0, so this difference is 
a multiple of 10. On the other hand, if their right-most digits are different, then 
because they are in the same box, their rightmost digits sum to 10. Therefore 
the rightmost digit of the sum x+y is 0, so the sum is a multiple of 10. 


Section 3.10 
1. Show that 1(n - 0) +2(n-1) + 3(n-2)+ 4(n - 3) +++ +(n- 1)2+(n-0)1 = ("3?). 


Solution: Let S = {0,1,2,3,...,2,n +1}, which is a set with n+2 elements. The 
right-hand side ("}”) of our equations is the number of 3-element subsets of S. 


Let’s now count these 3-element subsets in a different way. Any such subset X 
can be written as X = {j,k,¢}, where Os j<k<@€<n+1. Note that this forces the 


Richard Hammack Book of Proof 


313 


middle element k to be in the range 1<k <n. Given a fixed middle element k, 
there are k choices for the smallest element j and n+1-k choices for the largest 
element . 


0 12 -. k-1 k k+1 k+2 k+3 +> n ne+i1 
t 


k choices for j : n+1-k choices for @ 
middle 


By the multiplication principle, there are k(n +1-k) possible 3-element sets 
X with middle element k. For example, if k = 1, there are 1(n—0) sets X with 
middle element 1. If k = 2, there are 2(n—1) sets X with middle element 2. If 
k =8, there are 3(n — 2) sets X with middle element 3. Thus the left-hand side of 
our equation counts up the number of 3-element subsets of S, so it is equal to 
the right-hand side. 

. Show that (5) (73) = (2)(3): 

Solution: Consider the following problem. From a group of n people, you need to 
select k people to serve on a committee, and you also need to select 2 of these k 
people to lead the committee’s discussion. In how many ways can this be done? 


One approach is to first select k people from n, and then select 2 of these k people 
to lead the discussion. By the multiplication principle, there are (’)(*) ways to 
make this selection. 


Another approach is to first select 2 of the n people to be the discussion leaders, 
and there are (5) ways to do this. Next we need to fill out the committee by 


selecting k- 2 people from the remaining n- 2 people, and there are GS ways 


to do this. By the multiplication principle, there are (5) (7 ways to make the 
selection. 


By the previous two paragraphs, (”)(”—3) and (”)(*) are both answers to the same 


counting problem, so they are equal. 
. Show that (?”) = 2(%) +n?. 
Solution: Let S be a set with 2n elements. Then the left-hand side counts the 


number of 2-element subsets of S. 


Let’s now count this in a different way. Split S as S = AUB, where |A|=n=|B|. We 
can choose a 2-element subset of S in three ways: We could choose both elements 
from A, and there are (5) ways to do this. We could choose both elements from 
B, and there are (5) ways to do this. Or we could choose one element from A 
and then another element from B, and by the multiplication principle there 
are n-n =n? ways to do this. Thus the number of 2-element subsets of S is 
(5) + (5) +n? = 2(5)+n?, and this is the right-hand side. Therefore the equation 
holds because both sides count the same thing. 


Pp m+n 
- Show that ¥ (7)(.%) = ("3") 


Solution: Take three non-negative integers m,n and p. Let S be a set with 
|S|=m-+n, so the right-hand side counts the number of p-element subsets of S. 


Free PDF version COE 


314 Solutions 


11. 


Now let’s count this in a different way. Split S as S=AUB, where |A| =m and 
|B| =n. We can make any p-element subset of S by choosing k of its elements 
from A in and p—k of its elements from B, for any 0<k <p. There are (7) ways 
to choose k elements from A, and ( ,) ways to choose p—-k elements from B, 
so there are (';)(,,”,,) ways to make a p-element subset of S that has k elements 
from A. As k could be any number between 0 and p, the left-hand side of our 
equation counts up the p-element subsets of S. Thus the left- and right-hand 
sides count the same thing, so they are equal. 


m+1 


. Show that  (*)=("*). 
k=m 


Solution: Let S = {0,1,2,...,n}, so |S|=n+1. The right-hand side of our equation 
is the number of subsets X of S with m+1 elements. 


Now let’s think of a way to make such an X ¢ S with |X|=m+1. We could begin 
by selecting a largest element k for X. Now, once we have chosen k, there are 
k elements in S to the left of k, and we need to choose m of them to go in X (so 
these, along with k, form the set X). 


S={ 0, 1, 2, 3, 4, 5, ---, k-1, k, k+1, k+2, k+3, ---, n} 
———— i a 
choose m of these k numbers for X 1 l 
argest 
number 
in X 


There are (*) ways to choose these m numbers, so there are (*) subsets of S 

whose largest element is k. Notice that we must have m<k <n. (The largest 

element k of X cannot be smaller than m because we need at least m elements 

on its left.) Summing over all possible largest values in X, we see that 2 @ ) 
=m 


equals the number of subsets of S with m+1 elements. 


nt+1 


vy are answers to the 


n 
The previous two paragraphs show that » (*) and ( 
k=m 
same counting question, so they are equal. 


Show that ¥ 2#(")=3". 
k=0 


Solution: Consider the problem of counting the number of length-n lists made 
from the symbols {a,b,c}, with repetition allowed. There are 3” such lists, so the 
right-hand side counts the number of such lists. 


On the other hand, given k with 0 <k <n, let’s count the lists that have exactly k 
entries unequal to a. There are 2"(?) such lists. (First choose & of n list positions 
to be filled with 6 or c, in (j) ways. Then fill these & positions with b’s and c’s 
in 2" ways. Fill any remaining positions with a’s.) As k could be any number 
between 0 and n, the left-hand side of our equation counts up the number of 
length-n lists made from the symbols {a,b,c}. Thus the right- and left-hand sides 
count the same thing, so they are equal. 


Richard Hammack Book of Proof 


315 


Chapter 4 Exercises 


1. 


If x is an even integer, then x? is even. 


Proof. Suppose x is even. Thus x = 2a for some a€é Z. 

Consequently x? = (2a)? = 4a? = 2(2a?). 

Therefore x? = 2b, where b is the integer 2a”. 

Thus x? is even by definition of an even number. | 


. Ifa is an odd integer, then a? + 3a +5 is odd. 


Proof. Suppose a is odd. 

Thus a = 2c +1 for some integer c, by definition of an odd number. 

Then a? + 3a+5=(2c +1)? +3(2c+1)+5=4c7+4c+1+6c+34+5=4c7+10c+9 

= 4c? +10c+8+1=2(2c7+5c+4)+1. 

This shows a2 +3a+5=2b+1, where b =2c?+5c+4eZ. 

Therefore a? +3a+5 is odd. | 


. Suppose x,y eZ. If x is even, then xy is even. 


Proof. Suppose x,y e€ Z and x is even. 

Then x = 2a for some integer a, by definition of an even number. 

Thus xy = (2a)(y) = 2(ay). 

Therefore xy = 2b where b is the integer ay, so xy is even. a 


. Suppose a,be Z. If a|b, then a? | b?. 


Proof. Suppose a |b. 

By definition of divisibility, this means b = ac for some integer c. 
Squaring both sides of this equation produces b? = a2c?. 

Then b2 = a2d, where d =c7€Z. 


By definition of divisibility, this means a? | b?. | 


. Suppose a is an integer. If 7| 4a, then 7|a. 


Proof. Suppose 7 | 4a. 

By definition of divisibility, this means 4a = 7c for some integer c. 

Since 4a = 2(2a) it follows that 4a is even, and since 4a = 7c, we know 7c is even. 
But then c can’t be odd, because that would make 7c odd, not even. 

Thus c is even, so c = 2d for some integer d. 

Now go back to the equation 4a = 7c and plug in c = 2d. We get 4a = 14d. 
Dividing both sides by 2 gives 2a = 7d. 

Now, since 2a = 7d, it follows that 7d is even, and thus d cannot be odd. 

Then d is even, so d = 2e for some integer e. 

Plugging d = 2e back into 2a = 7d gives 2a = 14e. 

Dividing both sides of 2a = 14e by 2 produces a = 7e. 

Finally, the equation a = 7e means that 7|a, by definition of divisibility. a 


Free PDF version [S)= 


316 Solutions 


11. 


13. 


15. 


17. 


19. 


21. 


Suppose a,b,c,dé Z. Ifa|b and c|d, then ac| bd. 


Proof. Suppose a|6 and c|d. 

As a|, the definition of divisibility means there is an integer x for which 6 = ax. 
As c|d, the definition of divisibility means there is an integer y for which d = cy. 
Since b = ax, we can multiply one side of d = cy by b and the other by ax. 

This gives bd =axcy, or bd =(ac\(xy). 

Since xy € Z, the definition of divisibility applied to bd =(ac)(xy) givesac|bd. 


Suppose x,y eR. If x?+5y= y?2+5x, thenx=y or x+y=5. 


Proof. Suppose x? + 5y = y2 + 5x. 

Then x? — y? = 5x—5y, and factoring gives (x — y)(x+ y) = 5(x— y). 

Now consider two cases. 

Case 1. If x—y 4 0 we can divide both sides of (x—y)(x+ y) = 5(x—y) by the non-zero 
quantity x—y to getx+y=5. 

Case 2. If x—y=0, then x= y. (By adding y to both sides.) 

Thus x=y or x+y=5. a 


If n€ Z, then n? +3n+4 is even. 


Proof. Suppose ne€ Z. We consider two cases. 

Case 1. Suppose n is even. Then n = 2a for some ae Z. 

Therefore n? +3n+4 = (2a)? +3(2a)+4 = 4a? + 6a + 4 = 2(2a7 + 3a +2). 

So n2+3n+4=2b where b = 2a7+3a+2€Z, so n2+3n+4 is even. 

Case 2. Suppose n is odd. Then n = 2a +1 for some ae Z. 

Therefore n? +3n+4 =(2a+1)?+3(2a+1)+4=4a7+4a+1+6a+3+4=4a7+10a+8 
= 2(2a2 +5a+4). So n2+3n+4=2b where b = 2a2+5a+4€Z, so n?+3n+4 is even. 


In either case n? + 3n +4 is even. | 
If two integers have opposite parity, then their product is even. 


Proof. Suppose a and b are two integers with opposite parity. Thus one is even 
and the other is odd. Without loss of generality, suppose a is even and b is 
odd. Therefore there are integers c and d for which a = 2c and b=2d+1. Then 
the product of a and 6 is ab = 2c(2d + 1) = 2(2cd +c). Therefore ab = 2k where 
k=2cd+ceZ. Therefore the product ab is even. a 


Suppose a,b,c Z. If a?|b and b®|c then a® |c. 


Proof. Since a? |b we have b = ka? for some k € Z. Since b? | c we have c = hb? for 
some he Z. Thus c = h(ka2)? = hk?a®. Hence a® | c. | 


If p is prime and 0<k <p then p| (%). 
Proof. From the formula (7) = ree we get p! = (2)(p—k)!k!. Now, since the 
prime number p is a factor of p! on the left, it must also be a factor of (?)(p —k)!! 


Richard Hammack Book of Proof 


23. 


25. 


27. 


317 


on the right. Thus the prime number p appears in the prime factorization of 
(2)(p—R)IR!. 

As k! is a product of numbers smaller than p, its prime factorization contains no 
p’s. Similarly the prime factorization of (p—)! contains no p’s. But we noted that 
the prime factorization of ()(p—z)!! must contain a p, so the prime factorization 
of (2) contains a p. Thus (7) is a multiple of p, so p divides (2). a 


If n €N then (2”) is even. 


Proof. By definition, (*”) is the number of n-element subsets of a set A with 2n 
elements. For each subset X <A with |X|=n, the complement X is a different 
set, but it also has 2n -n =n elements. Imagine listing out all the n-elements 
subset of a set A. It could be done in such a way that the list has form 


X1,X1, Xo, Xe, X3,X3, X4,X4, X5, X5... 


This list has an even number of items, for they are grouped in pairs. Thus @) 
is even. 


If a,b,c€N and c<b<a then (%)(°) =(,%,)(°*°). 


c 


F a\jb) _ ! b! = 
PO Assume a,b,c EN a e<b<a. Then we have (3)(?) = qSyprocom = 
(a—b+c)! _ (a—b+c)! _ (2 joe) a 

(a- 5a b)! (b-c)!le! ~~ (b= Ie b+c)! (a—b)!c! ~ b-e c : 


Suppose a,beN. If ged(a,b) > 1, then b|a or b is not prime. 


Proof. Suppose ged(a,b)>1. Let c = gced(a,b)>1. Then since c is a divisor of both 
a and b, we have a= cx and b = cy for integers x and y. We divide into two cases 
according to whether or not b is prime. 

Case I. Suppose 6 is prime. Then the above equation b = cy with c > 1 forces 
c=6b and y=1. Then a = cx becomes a = bx, which means b | a. We conclude that 
the statement “b|a or b is not prime,” is true. 

Case II. Suppose b is not prime. Then the statement “b|a or b is not prime,” is 
automatically true. a 


Chapter 5 Exercises 


1. 


Suppose n € Z. If n? is even, then n is even. 


Proof. (Contrapositive) Suppose n is not even. foes n is odd, son =2a+1 for 
some integer a, by definition of = odd number. Thus n? = (2a+1)? = 4a2+4a+1= 
2(2a? + 2a)+1. Consequently n? = 2b +1, where d is the integer 2a? + 2a, so n? is 
odd. Therefore n? is not even. | 


Free PDF version [S)= 


318 Solutions 


3. 


11. 


Suppose a,b Z. If a?(b? — 2b) is odd, then a and b are odd. 


Proof. (Contrapositive) Suppose it is not the case that a and b are odd. Then, 
by DeMorgan’s law, at least one of a and 6 is even. Let us look at these cases 
separately. 

Case 1. Suppose a is even. Then a = 2c for some integer c. Thus a?(b? — 2b) 
= (2c)*(b? — 2b) = 2(2c7(b — 2b)), which is even. 

Case 2. Suppose b is even. Then b = 2c for some integer c. Thus a?(b? — 2b) 
= a?((2c)? — 2(2c)) = 2(a2(2c? — 2c)), which is even. 

(A third case involving a and b both even is unnecessary, for either of the two 
cases above cover this case.) Thus in either case a(b? — 2b) is even, so it is not 
odd. a 


. Suppose xe R. If x2+5x<0 then x<0. 


Proof. (Contrapositive) Suppose it is not the case that x <0, sox=0. Then 
neither x? nor 5x is negative, so x?+5x >0. Thus it is not true that x?+5x<0. 


. Suppose a,be Z. If both ab and a +6 are even, then both a and b are even. 


Proof. (Contrapositive) Suppose it is not the case that both a and 6 are even. 
Then at least one of them is odd. There are three cases to consider. 

Case 1. Suppose a is even and 0 is odd. Then there are integers c and d for which 
a=2c and b=2d+1. Then ab = 2c(2d + 1), which is even; anda+b=2c+2d+1= 
2(c + d)+1, which is odd. Thus it is not the case that both ab and a+b are even. 
Case 2. Suppose a is odd and b is even. Then there are integers c and d for 
which a = 2c +1 and b = 2d. Then ab = (2c + 1)(2d) = 2(d(2c + 1)), which is even; 
and a+b =2c+1+2d = 2(¢+d)+1, which is odd. Thus it is not the case that both 
ab and a+0 are even. 

Case 3. Suppose a is odd and b is odd. Then there are integers c and d for 
which a = 2c+1 and b=2d+1. Then ab = (2c+1)(2d+1)= 4cd+2c+2d+1= 
2(2cd +c¢+d)+1, which is odd; and a+b = 2c+1+2d+1=2(c+d+1), which is even. 
Thus it is not the case that both ab and a+b are even. 

These cases show that it is not the case that ab and a+b are both even. (Note 
that unlike Exercise 3 above, we really did need all three cases here, for each 
case involved specific parities for both a and b.) a 


. Suppose n€ Z. If 3{n?, then 3{n. 


Proof. (Contrapositive) Suppose it is not the case that 3{n, so 3|n. This means 
that n = 3a for some integer a. Consequently n? = 9a, from which we get n? = 
3(3a7). This shows that there in an integer b = 3a? for which n? = 3b, which 
means 3|n?. Therefore it is not the case that 3{n?. I 


Suppose x,y Z. If x?(y+3) is even, then x is even or y is odd. 


Proof. (Contrapositive) Suppose it is not the case that x is even or y is odd. 
Using DeMorgan’s law, this means x is not even and y is not odd, which is to 


Richard Hammack Book of Proof 


13. 


15. 


17. 


19. 


21. 


319 


say x is odd and y is even. Thus there are integers a and b for which x = 2a+1 
and y = 2b. Consequently x?(y +3) = (2a+1)7(2b + 3) = (4a? + 4a + 1)(2b + 3) = 
8a2b + 8ab + 2b + 12a” + 12a +3 = 8a7b + 8ab + 2b +12a7 + 12a+2+1= 

2(4a7b + 4ab +b + 6a? +6a+1)+1. This shows x?(y+3)=2c+1 for c= 4a7b+4ab+ 
b+6a?+6a+1¢Z. Consequently, x«?(y +3) is not even. | 


Suppose xe R. If x° + 7x3 + 5x >«4+x74+8, then x>=0. 


Proof. (Contrapositive) Suppose it is not true that x= 0. Then x < 0, that is x is 
negative. Consequently, the expressions x°, 7x® and 5x are all negative (note the 
odd powers) so x° + 7x? + 5x <0. Similarly the terms x*, x? and 8 are all positive 
(note the even powers), so 0 < x*+x?+8. From this we get x°+7x°+5x <«*+«7+8, 
so it is not true that x° + 7x3 +5x«>x4+x74+8. Z 


Suppose x Z. If x?—1 is even, then x is odd. 


Proof. (Contrapositive) Suppose x is not odd. Thus x is even, so x = 2a for some 
integer a. Then x? —1= (2a)? -1 = 8a°-1= 8a? —-2+1 =2(4a-—1)+1. Therefore 
x° —1=2b+1 where b = 4a®-1€Z, so x®-1is odd. Thus x?-1lisnoteven. & 


If n is odd, then 8|(n2-1). 


Proof. (Direct) Suppose n is odd, so n = 2a +1 for some integer a. Then n?-1= 
(2a + 1)? -1= 4a? + 4a = 4(a?2 +a) = 4a(a+1). So far we have n2-1=4a(a+ 1), but 
we want a factor of 8, not 4. But notice that one of a or a+ 1 must be even, so 
a(a +1) is even and hence a(a+ 1) = 2c for some integer c. Now we have n?-1= 
4a(a +1) = 4(2c)=8c. But n?-1=8c means 8|(n?-1). = 


Let a,b,ceZ andneN. If a=b (mod n) and a=c (mod n), then c=b (mod n). 


Proof. (Direct) Suppose a = b (mod n) and a =c (mod n). 

This means n|(a—b) and n|(a-c). 

Thus there are integers d and e for which a—b=nd and a—c=ne. 

Subtracting the second equation from the first gives c-—b =nd-ne. 

Thus c—b=n(d-e), so n|(c—b) by definition of divisibility. 

Therefore c = b (mod n) by definition of congruence modulo n. a 


Let a,be Zand neN. If a=b (mod n), then a® = b? (mod n). 


Proof. (Direct) Suppose a = b (mod n). This means n | (a—b), so there is an 
integer c for which a—b=nc. Then: 


a-b = ne 
(a—b)(a?+ab+b”) = ne(a®+ab+b") 
a? +a7b +ab? — ba? — ab? —b® nce(a? +ab +b”) 
a®-—b® = nce(a*+ab+b?). 


Since a?+ab+b? €Z, the equation a? — b? = nc(a? +.ab + b?) implies n | (a? — b°), 


and therefore a® = 6? (mod n). a 


Free PDF version [S)= 


320 Solutions 


23. 


25. 


27. 


29. 


31. 


Let a,b€ Z and neN. If a=b (mod n), then a? = ab (mod n). 


Proof. (Direct) Suppose a = b (mod n). This means n |(a— 6), so there is an 
integer d for which a—b = nd. Multiply both sides of this by a to get a?—ab = and. 
Consequently, there is an integer e = da for which a? — ab = ne, so n|(a?—ab) and 
consequently a? = ab (mod n). r 


If ne N and 2” —1 is prime, then n is prime. 


Proof. Assume n is not prime. Write n = ab for some a,b > 1. Then 2”-1= 
20> 1 = (2 1)(209-8 4.200-26 4 gab-3b 5 ...4.9ab-ab) Hence 2”—1is composite. 


If a =0 (mod 4) or a =1 (mod 4) then ($) is even. 


Proof. We prove this directly. Assume a = 0 (mod 4). Then (3) = “4”. Since 
a= 4k for some k €N, we have (3) = #4-» = 2k(4k — 1). Hence () is even. 

Now assume a = 1 (mod 4). Then a = 4k +1 for some k € N. Hence (3) = “+04 = 
2k(4k + 1). Hence, (§) is even. This proves the result. a 


If integers a and b are not both zero, then ged(a, b) = gcd(a — b,b). 


Proof. (Direct) Suppose integers a and b are not both zero. Let d = ged(a,b). 
Because d is a divisor of both a and b, we have a = dx and b = dy for some 
integers x and y. Then a—b=dx-dy=d(x~—,y), so it follows that d is also a 
common divisor of a—6 and b. Therefore it can’t be greater than the greatest 
common divisor of a—b and b, which is to say ged(a,b) = d < gcd(a — 6, b). 


Now let e = gcd(a—b,b). Then e divides both a- b and b, that is, a— b = ex and 
b = ey for integers x and y. Then a =(a—b)+b=ex+ey=e(x+y), SO NOW we 
see that e is a divisor of both a and b. Thus it is not more than their greatest 
common divisor, that is, gcd(a — b,b) =e < ged(a, b). 


The above two paragraphs have given gcd(a, b) < gcd(a — b,b) and gcd(a — b,b) < 
gced(a,b). Thus gced(a, b) = ged(a — b, d). | 


Suppose the division algorithm applied to a and 6b yields a= qb+r. Then 
gcd(a,b) = gcd(r, bd). 


Proof. Suppose a=qb+r. Let d = gcd(a,b), so d is a common divisor of a and b; 
thus a = dx and b = dy for some integers x and y. Then dx=a=qb+r=qdyt+tr, 
hence dx = qdy+r, and sor=dx-qdy =d(x-—qy). Thus d is a divisor of r (and 
also of b), so gcd(a, b) = d < gced(r,b). 


On the other hand, let e = ged(r,b), so r= ex and b =ey for some integers x and y. 
Then a=qb+r=qeytex=e(qy+x). Hence e is a divisor of a (and of course also 
of b) so ged(r,b) = e < ged(a, b). 


We've shown gcd(a, b) < ged(r,b) and gced(r, b) < ged(a,b), so ged(r,b) = ged(a,b). 


Richard Hammack Book of Proof 


321 


Chapter 6 Exercises 


1. Suppose n is an integer. If n is odd, then n? is odd. 


Proof. Suppose for the sake of contradiction that n is odd and n? is not odd. 
Then n? is even. Now, since n is odd, we have n = 2a+1 for some integer a. 
Thus n? = (2a + 1)? = 4a7+4a+1= 2(2a2 + 2a)+1. This shows n? = 2b +1, where 
b is the integer b = 2a2+2a. Therefore we have n? is odd and n? is even, a 
contradiction. a 


3. Prove that ~/2 is irrational. 


Proof. Suppose for the sake of contradiction that \/2 is not irrational. Therefore 
it is rational, so there exist integers a and b for which \/2 = ¢. Let us assume that 
this fraction is reduced, so a and b are not both even. Now we have (/2” = (ey, 
which gives 2 = a, or 2b° = a°. From this we see that a? is even, from which we 
deduce that a is even. (For if a were odd, then a? = (2c + 1)? = 8c? + 12c?+6c+1= 
2(4c? + 6c? + 3c) + 1 would be odd, not even.) Since a is even, it follows that a = 2d 
for some integer d. The equation 2b? = a? from above then becomes 2b? = (2d)°, 
or 2b° = 8d°. Dividing by 2, we get b® = 4d°, and it follows that b° is even. Thus b 
is even also. (Using the same argument we used when a® was even.) At this point 
we have discovered that both a and bd are even, contradicting the fact (observed 
above) that the a and 6 are not both even. a 


Here is an alternative proof. 


Proof. Suppose for the sake of contradiction that #2 is not irrational. Therefore 
there exist integers a and 6 for which V/2= 5: Cubing both sides, we get 2 = ae 
From this, a? = b? + 5°, which contradicts Fermat’s last theorem. | 


5. Prove that V3 is irrational. 


Proof. Suppose for the sake of contradiction that V3 is not irrational. Therefore 
it is rational, so there exist integers a and b for which V3 = ¢- Let us assume 
that this fraction is reduced, so a and b have no common factor. Notice that 
V3 = (2)", so 3= 2, or 3b? =a”. This means 3|a?. 


Now we are going to show that if a € Z and 3| a”, then 3|a. (This is a proof-within- 
a-proof.) We will use contrapositive proof to prove this conditional statement. 
Suppose 3{a. Then there is a remainder of either 1 or 2 when 8 is divided into a. 
Case 1. There is a remainder of 1 when 8 is divided into a. Then a=3m+1 
for some integer m. Consequently, a? = 9m? + 6m+1= 3(3m?+2m)+1, and this 
means 3 divides into a” with a remainder of 1. Thus 3{a?. 

Case 2. There is a remainder of 2 when 38 is divided into a. Then a =3m+2 for 
some integer m. Consequently, a? = 9m? +12m+4= 9m?+12m+34+1= 3(38m?+ 
4m+1)+1, and this means 3 divides into a? with a remainder of 1. Thus 3{a?. 

In either case we have 3{ a”, so we’ve shown 3{a implies 3{a?. Therefore, if 3| a, 
then 3|a. 


Free PDF version [S)= 


322 Solutions 


11. 


13. 


Now go back to 3| a? in the first paragraph. This combined with the result of 
the second paragraph implies 3| a, so a = 3d for some integer d. Now also in the 
first paragraph we had 3b? = a?, which now becomes 3b? = (3d)? or 3b? = 9d, so 
b? = 3d?. But this means 3| b?, and the second paragraph implies 3| 6. Thus we 
have concluded that 3|a and 3| 6, but this contradicts the fact that the fraction 
& is reduced. a 


. Ifa,be Z, then a2 -—4b-30. 


Proof. Suppose for the sake of contradiction that a,b € Z but a?-4b-3=0. Then 
we have a2 = 4b +3 = 2(2b + 1)+1, which means a? is odd. Therefore a is odd also, 
so a=2c+1 for some integer c. Plugging this back into a? — 4b —3 = 0 gives us 


(2c+1)?-4b-3 = 
4c? +4c+1-4b-3 = 
4c? +4c-4b = 
2c7+2c-2b = 
Ac2+ce-b) = 


rPePN oO 


From this last equation, we see that 1 is an even number, a contradiction. 


. Suppose a,b€ Randa #0. Ifa is rational and ad is irrational, then 6 is irrational. 


Proof. Suppose for the sake of contradiction that a is rational and ab is irrational 
and 0 is not irrational. Thus we have a and b rational, and ad irrational. Since a 
and 6 are rational, we know there are integers c,d,e,f for which a= § and b= rz 
Then ab = 5, and since both ce and df are integers, it follows that ab is rational. 
But this is a contradiction because we started out with ab irrational. | 


There exist no integers a and b for which 18a + 6) = 1. 


Proof. Suppose for the sake of contradiction that there do exist integers a and b 
with 18a+6b =1. Then 1=2(9a+3b), which means 1 is even, acontradiction. 


For every x € [1/2,7], sinx—cosx=1. 


Proof. Suppose for the sake of contradiction that x € [/2,7], but sinx-—cosx <1. 
Since x € [z/2,7], we know sinx = 0 and cosx < 0, so sinx—cosx = 0. Therefore 
we have 0 < sinx-—cosx <1. Now the square of any number between 0 and 
1 is still a number between 0 and 1, so we have 0 < (sinx —cosx)? < 1, or 0< 
sin? x — 2sinxcosx +cos?x < 1. Using the fact that sin? x + cos? x = 1, this becomes 
0<-2sinxcosx+1<1. Subtracting 1, we obtain —2sinxcosx <0. But above we 
remarked that sinx = 0 and cosx < 0, and hence —2sinxcosx = 0. We now have the 
contradiction —2sinxcosx <0 and —2sinxcosx = 0. B 


Richard Hammack Book of Proof 


15. 


17. 


19. 


21. 


323 


If be Z and b{k for every kEN, then b=0. 


Proof. Suppose for the sake of contradiction that b € Z and b{k for every keEN, 
but 6 40. 

Case 1. Suppose b >0. Then b€EN, so b|b, contradicting b{k for every REN. 
Case 2. Suppose b <0. Then —b€N, so b|(—b), again a contradiction a 


For every n€ Z, 4{(n? +2). 


Proof. Assume there exists n € Z with 4|(n?+2). Then for some k € Z, 4k =n? +2 
or 2k =n*+2(1-2). If n is odd, this means 2k is odd, and we’ve reached a 
contradiction. If n is even then n = 2j and we get k = 2j7+1-k for some j€ Z. 
Hence 2(k — 7) = 1, so 1 is even, a contradiction. | 


Remark. It is fairly easy to see that two more than a perfect square is always 
either 2 (mod 4) or 3 (mod 4). This would end the proof immediately. 


The product of 5 consecutive integers is a multiple of 120. 


Proof. Starting from 0, every fifth integer is a multiple of 5, every fourth integer 
is a multiple of 4, every third integer is a multiple of 3, and every other integer 
is a multiple of 2. It follows that any set of 5 consecutive integers must contain a 
multiple of 5, a multiple of 4, at least one multiple of 3, and at least two multiples 
of 2 (possibly one of which is a multiple of 4). It follows that the product of five 
consecutive integers is a multiple of 5-4-3-2=120. | 


For another approach, consider a product n(n — 1)(n — 2)(n — 3)(n — 4) of five con- 


secutive integers (the largest of which is n). Now, we know that (f) is an integer, 


and (3) = ay = es = Mn Din=7in—3in—4) so 120 divides the product. 


Hints for Exercises 20-23. For Exercises 20, first show that the equation a?+b? = 
3c? has no solutions (other than the trivial solution (a,b,c) = (0,0,0)) in the 
integers. To do this, investigate the remainders of a sum of squares (mod 4). 
After you’ve done this, prove that the only solution is indeed the trivial solution. 
Next assume that the equation x? + y?-3=0 has a rational solution. Use the 
definition of rational numbers to yield a contradiction. 


Chapter 7 Exercises 


1. 


Suppose xe Z. Then x is even if and only if 3x +5 is odd. 


Proof. We first use direct proof to show that if x is even, then 3x+65 is odd. If x 
is even, then x = 2n for some integer n, so 3x+5 = 3(2n)+5= 6n+5=6n+4+1= 
2(3n +2)+1. Thus 3x+5 is odd because it has form 2k +1, where k = 38n+2e Z. 


Conversely, we need to show that if 3x+5 is odd, then x is even. We will prove 
this using contrapositive proof. Suppose x is not even. Then x is odd, so x = 2n +1 
for some integer n. Thus 3x+5 = 3(2n+1)+5 = 6n+8 = 2(38n+4). This means says 
3x +5 is twice the integer 3n+4, so 3x+5 is even, not odd. | 


Free PDF version [S)= 


324 Solutions 


2 


3. Given an integer a, then a® +a? +a is even if and only if a is even. 


Proof, First we will prove that if a? +a?+a is even then a is even. This is done 


with contrapositive proof. Suppose a is not even. Then a is odd, so there is an 
integer n for which a = 2n+1. Then 


a +a7+a (Qn +1)? + (2n + 1)? + (2n +1) 
= 8n?+12n7+6n+1+4n727+4n+14+2n+1 
= 8n?+16n7+12n+2+1 
= 2(4n?+8n?+6n+1)+1. 


3 2 3 2 


+a as twice an integer plus 1, so a’? +a“+a is odd, not even. 


24a is even then a is even. 


This expresses a” +a 
We have now shown that if a? +a 


Conversely, we need to show that if a is even, then a? +a?+a is even. We will use 


direct proof. Suppose a is even, so a = 2n for some integer n. Then a? +a?2+a= 
(2n)? + (2n)2 + 2n = 8n? + 4n? + 2n = 2(4n? + 2n2+n). Therefore, a® +a2+a is even 
because it’s twice an integer. fe 


5. An integer a is odd if and only if a° is odd. 


Proof. Suppose that a is odd. Then a = 2n+1 for some integer n, and a® = 
(2n +1)? = 8n?+12n?+6n+1= 2(4n?+6n?+3n)+1. This shows that a° is twice an 
integer, plus 1, so a® is odd. Thus we’ve proved that if a is odd then a? is odd. 


Conversely we need to show that if a® is odd, then a is odd. For this we employ 
contrapositive proof. Suppose a is not odd. Thus a is even, so a = 2n for some 
integer n. Then a® = (2n)? = 8n? = 2(4n°) is even (not odd). | 


7. Suppose x,y¢€R. Then (x + y)? =x?+ y? if and only if x =0 or y=0. 


Proof. First we prove with direct proof that if (x+y)? = «7+ y?, then x=0or y=0. 
Suppose (x+y)? = x?+y?. From this we get x2+2xy+ y?=x?+4+ y?, so 2xy =0, and 


hence xy = 0. Thus x=0 or y=0. 


Conversely, we need to show that if x = 0 or y=0, then (x + y)? =x?+y?. This will 
be done with cases. 

Case 1. If x =0 then (x+y)? =(0+ y)? = y2 =07 + y? =x74+ y?. 

Case 2. If y= 0 then (x+y)? = («+ 0)? = x? =x7+0? =x? 4 y?. 

Either way, we have (x+y)? =x? + y?. | 


9. Suppose ae Z. Prove that 14|a if and only if 7|a and 2|/a. 


Proof. First we prove that if 14|a, then 7|a and 2|a. Direct proof is used. 
Suppose 14|a. This means a = 14m for some integer m. Therefore a = 7(2m), 
which means 7|a, and also a = 2(7m), which means 2|a. Thus 7|a and 2|a. 


Conversely, we need to prove that if 7|a and 2|a, then 14|a. Once again direct 
proof if used. Suppose 7|a and 2|a. Since 2|a it follows that a =2m for some 


Richard Hammack Book of Proof 


11. 


13. 


15. 


325 


integer m, and that in turn implies that a is even. Since 7|a it follows that 
a=7n for some integer n. Now, since a is known to be even, and a = 7n, it follows 
that n is even (if it were odd, then a = 7n would be odd). Thus n = 2p for an 
appropriate integer p, and plugging n = 2p back into a = 7n gives a = 7(2p), so 
a=14p. Therefore 14|a. a 


Suppose a,b Z. Prove that (a —3)b? is even if and only if a is odd or b is even. 


Proof, First we will prove that if (a — 3)b? is even, then a is odd or 6 is even. For 
this we use contrapositive proof. Suppose it is not the case that a is odd or b is 
even. Then by DeMorgan’s law, a is even and b is odd. Thus there are integers m 
and n for which a = 2m and b = 2n +1. Now observe (a — 3)b? = (2m —3)(2n + 1)? = 
(2m -—3)(4n? +4n+1) = 8mn?2+8mn+2m-12n?-12n-3 = 8mn?+8mn+ 2m -12n?- 
12n-44+1= 2(4mn?2+4mn+m-6n?-6n-2)+1. This shows (a—3)b? is odd, so it’s 
not even. 


Conversely, we need to show that if a is odd or 6 is even, then (a —3)b? is even. 
For this we use direct proof, with cases. 

Case 1. Suppose a is odd. Then a = 2m +1 for some integer m. Thus (a — 3)b? = 
(2m + 1—3)b? = (2m — 2)b? = 2(m—1)b?. Thus in this case (a —3)b? is even. 

Case 2. Suppose b is even. Then 6 = 2n for some integer n. Thus (a —3)b? = 
(a—3)(2n)? = (a—3)4n? = 2(a — 3)2n? =. Thus in this case (a — 3)b? is even. 
Therefore, in any event, (a — 3)b? is even. a 


Suppose a,b¢ Z. If a+ is odd, then a? + b? is odd. 
Hint: Use direct proof. Suppose a+b is odd. Argue that this means a and b have 
opposite parity. Then use cases. 


Suppose a,be Z. Prove that a+6 is even if and only if a and b have the same 
parity. 


Proof. First we will show that if a+b is even, then a and b have the same parity. 
For this we use contrapositive proof. Suppose it is not the case that a and b have 
the same parity. Then one of a and 0 is even and the other is odd. Without loss 
of generality, let’s say that a is even and b is odd. Thus there are integers m and 
n for which a = 2m and 6=2n+1. Thena+b=2m+2n+1=2(m+n)+1,soa+b 
is odd, not even. 


Conversely, we need to show that if a and b have the same parity, then a+b is 
even. For this, we use direct proof with cases. Suppose a and b have the same 
parity. 

Case 1. Both a and 6 are even. Then there are integers m and n for which 
a=2m and b=2n, soa+b=2m+2n = 2(m+n) is clearly even. 

Case 2. Both a and 0 are odd. Then there are integers m and n for which 
a=2m+1and b=2n+1, soa+b=2m+1+2n+1=2(m+n +1) is clearly even. 
Either way, a+b is even. This completes the proof. a 


Free PDF version [S)= 


326 Solutions 


17. There is a prime number between 90 and 100. 


19. 


21. 


23. 


25. 


Proof. Simply observe that 97 is prime. a 
If n EN, then 2° +21 +274 23494+...49% =9"t1_1, 


Proof. We use direct proof. Suppose ne N. Let S be the number 

S =2949149%4934944...4Qn-149", (1) 
In what follows, we will solve for S and show S = 2”*!—1. Multiplying both sides 
of (1) by 2 gives 

2S = 21427493 494495 4...497 49741, (2) 
Now subtract Equation (1) from Equation (2) to obtain 2S — S = —-29+2”*1, which 
simplifies to S = 2”+!-1. Combining this with Equation (1) produces 2° + 2! + 
274.93 4944...42”=2"+1_1, so the proof is complete. | 


Every real solution of x? + x+3=0 is irrational. 


Proof. Suppose for the sake of contradiction that this polynomial has a rational 
solution ¢. We may assume that this fraction is fully reduced, so a and 6 are not 


both even. We have (2)? +$+3=0. Clearing the denominator gives 
a? +ab" +3b3 =0. 


Consider two cases: First, if both a and b are odd, the left-hand side is a sum 
of three odds, which is odd, meaning 0 is odd, a contradiction. Second, if one of 
a and b is odd and the other is even, then the middle term of a? + ab? +30? is 
even, while a? and 3b? have opposite parity. Then a? +.ab? +36? is the sum of two 
evens and an odd, which is odd, again contradicting the fact that 0is even. M 


Suppose a,b and ¢ are integers. If a|b and a|(b?—c), then a|c. 


Proof. (Direct) Suppose a | 6 and a|(b?—c). This means that b = ad and b?-c = ae 
for some integers d and e. Squaring the first equation produces b? = a?d?. 
Subtracting b?—c = ae from b? = a2d? gives c = a2d?—ae = a(ad?-e). As ad?-e€ Z, 
it follows that a|c. a 


If p > 1is an integer and n{p for each integer n for which 2<7n< ,/p, then p is 
prime. 


Proof. (Contrapositive) Suppose that p is not prime, so it factors as p = mn for 
l<m,n<p. 

Observe that it is not the case that both m > \/p and n> \/p, because if this were 
true the inequalities would multiply to give mn > \/p,\/p = p, which contradicts 
p=mn. 

Therefore m < \/p or n<,/p. Without loss of generality, say n < \/p. Then the 
equation p = mn gives n|p, with 1<n< ,/p. Therefore it is not true that n{p for 
each integer n for which 2<n< \/p. a 


Richard Hammack Book of Proof 


327 


27. Suppose a,b« Z. If a2 +b? is a perfect square, then a and b are not both odd. 


29. 


31. 


33. 


35. 


Proof, (Contradiction) Suppose a? + b? is a perfect square, and a and b are both 
odd. As a? +b? is a perfect square, say c is the integer for which c? = a?+b?. As 
a and b are odd, we have a= 2m+1 and b = 2n+1 for integers m and n. Then 


c2 = a2 +b2 =(2m41)7 +(2n4+ 1)? =4(m727 +2 +m4tn)+2. 


This is even, so c is even also; let c = 2k. Now the above equation results in 
(2k)? = 4(m?2 + n?+m-+n)+2, which simplifies to 2k? = 2(m?+n?2+m+n)+1. Thus 
2k? is both even and odd, a contradiction. | 


If a| bc and ged(a,b) = 1, then a|c. 


Proof. (Direct) Suppose a| bc and gcd(a,b)= 1. The fact that a| bc means be = az 
for some integer z. The fact that gced(a,b) = 1 means that ax+by =1 for some 
integers x and y (by Proposition 7.1 on page 152). From this we get acx+ bey=c; 
substituting bc = az yields acx+azy=c, that is, a(cxt+zy)=c. Thereforea|c. 


If ne Z, then gced(n,n+1)=1. 


Proof. Suppose d is a positive integer that is a common divisor of n and n+1. 
Then n= dx and n+1=dy for integers x and y. Then 1=(n+1)-n=dy-—dx= 
d(y—x). Now, 1=d(y-—x) is only possible if d =+1 and y-x=+1. Thus the 
greatest common divisor of n and n+1 can be no greater than 1. But 1 does 
divide both n and n+1, so gcd(n,n+1)=1. | 


If ne Z, then ged(2n + 1,4n? +1) =1. 


Proof, Note that 4n? +1 =(2n+1)(2n-1)+2. Therefore, it suffices to show that 
gcd(2n + 1,(2n+1)(2n—-1)+2)=1. Let d be a common positive divisor of both 2n+1 
and (2n + 1)(2n—1)+2, so 2n+1=dx and (2n+1)(2n-1)+2=dy for integers x 
and y. Substituting the first equation into the second gives dx(2n—1)+2=dy, 
so 2=dy-—dx(2n-1)=d(y-2nx+<x). This means d divides 2, so d equals 1 or 
2. But the equation 2n + 1=dx means d must be odd. Therefore d = 1, that is, 
ged(2n + 1,(2n+ 1)(2n—-1)+2)=1. | 


Suppose a,be€N. Then a = ged(a, bd) if and only if a | b. 


Proof. Suppose a = gced(a,b). This means a is a divisor of both a and b. In 
particular a |b. 


Conversely, suppose a|b. Then a divides both a and b, so a < ged(a,b). On the 
other hand, since ged(a,b) divides a, we have a = gcd(a,b)-x for some integer x. 
As all integers involved are positive, it follows that a = gcd(a,b). 


It has been established that a < ged(a,b) and a = gcd(a,b). Thus a=gced(a,b). 


Free PDF version [S)= 


328 Solutions 


Chapter 8 Exercises 


1. 


Prove that {12n:n€Z}¢ {2n:neZ}n{38n:neE Z}. 


Proof. Suppose a € {12n:neZ}. This means a = 12n for some ne Z. Therefore 
a = 2(6n) and a = 3(4n). From a = 2(6n), it follows that a is multiple of 2, so ae 
{2n:neZ}. Froma = 3(4n), it follows that a is multiple of 3, soa € {8n: ne Z}. Thus 
by definition of the intersection of two sets, we have a€ {2n:né Z}n{3n:n€ Z}. 
Thus {12n :né Z}¢ {2n:neZ}n{8n:ne Z}. a 


3 IfkeZ, then {neZ:n|k}C{neZ:n|k7}. 


Proof. Suppose k € Z. We now need to show {n€Z:n|k} c {ne Z:n|k?}. 

Suppose ae {ne Z:n|k}. Then it follows that a|k, so there is an integer c for 
which k =ac. Then k? = a2c?. Therefore k? = a(ac?), and from this the definition 
of divisibility gives a|k?. But a|k? means that ac {ne Z:n|k?}. We have now 
shown {ne Z:n|k} S {ne Z:n| kh. | 


. If p and q are integers, then {pn:neN}n{qn:neN}FZ@. 


Proof. Suppose p and q are integers. Consider the integer pg. Observe that 
pq € {pn:neN} and pq € {qn:neEN}, so pg € {fpn:neN}n{qn:neN}. Therefore 
{pn:neN}n{qn:neN}F@. B 


. Suppose A,B and C are sets. If BCC, then AxBCAxC. 


Proof. This is a conditional statement, and we’ll prove it with direct proof. 
Suppose B <C. (Now we need to prove Ax BC AxC.) 


Suppose (a,b)¢ A x B. Then by definition of the Cartesian product we have ae A 
and b€ B. But since be B and BEC, we have be C. Since ae A and bEC, it 
follows that (a,b)« A x C. Now we’ve shown (a,b)¢ A x B implies (a,b) ¢ A x C, so 
AxBCAxC. 


In summary, we’ve shown that if BCC, then A x BC Ax C. This completes the 
proof. a 


. If A,B and C are sets then An(BUC)=(ANB)U(ANC). 


Proof. We use the distributive law PA(QVR) = (PAQ)v(P AR) from page 52. 


AnN(BUC) ={x:xeEA A xe BuC} (def. of intersection) 
={x:xe€A A (xe B v xEC)} (def. of union) 
={x:(xeA A x€B)v(xeA a xEC)} (distributive law) 
={x: (xe ANB) v (xE ANC)} (def. of intersection) 
= (AnB)u(AnC) (def. of union) 

The proof is complete. | 


Richard Hammack Book of Proof 


329 


11. If A and B are sets in a universal set U, then AUB=ANB. 


Proof. Just observe the following sequence of equalities. 


AUB =U-(AUB) (def. of complement) 
= {x:(xeU)A(x¢ AUB)} (def. of —) 
= {x:(xEU)A~ (xe AUB)} 
= {x:(xeU)A~ ((x€ A) V(x € B))} (def. of U) 
= {x:(x EU) N(~ (xE A)A ~ (x € B))} (DeMorgan) 
= {x:(xnEU)A(x€ A)A(x¢ B)} 
= {x:(xEU)A(XEU)A(KEA)A(KXEB)} (x€U)=(xE U)A(x EU) 


= {x:((xeEU)A(x€ A))A(xEU)A(KEB))} (regroup) 
= {x:(xnEeU)A(x¢ A)}N{x: (xe U)A(x¢B)} (def. of n) 


= (U-A)n(U-B) (def. of —) 
=AnB (def. of complement) 
The proof is complete. a 


13. If A,B and C are sets, then A-—(BUC)=(A-B)n(A-C). 


Proof. Just observe the following sequence of equalities. 


A-(BUC) = {x:(xEA)A(x¢ BUC)} (def. of —) 
= {x:(xE A)A~(xE BUC)} 
= {x:(xE A)A~ (xe B)V(xEC))} (def. of U) 
= {x:(x€ A)A(~ (x € B)A ~ (x € C))} (DeMorgan) 
= {x:(xE A)A(x€B)A(x€C)} 
= {x (xe A)A(xE ADAH EB)A(xEC)} (x€ A)=(xe€ A)A(x€ A) 


= {x (xe A)A(x¢€ B))AC(xE A)A(xEC))} (regroup) 
= {x:(xE A)A(xE BN {x:(xE A)A(xEC)} (def. of n) 
= (A-B)n(A-C) (def. of —) 
The proof is complete. a 


15. If A,B and C are sets, then (AnB)-—C =(A-—C)n(B-C). 


Proof. Just observe the following sequence of equalities. 


(ANB)-C = {x:(xe ANB)A(x€ C)} (def. of —) 
= {x:(we A)A (xe B)A(x¢ C)} (def. of n) 
= {x:(xeE A)A(xEC)A(xEB) A(x € CD} (regroup) 


= {x: (ee A)A(e¢C)A(xEB)A(x¢C))} (regroup) 
= {x:(xE A)A(xE C)}N{x:(xEB)A(x¢€C)} (def. of n) 
= (A-C)n(B-C) (def. of n) 
The proof is complete. a 


17. If A,B and C are sets, then A x (BNC)=(Ax B)n(AxC). 


Proof. See Example 8.12. a 


Free PDF version C\E 


330 Solutions 


19. 


21. 


23. 


25. 


Prove that {9” :neZ}< {3” :ne Z}, but {9":neZ}F {3" :neEZ. 


Proof. Suppose ae {9”:neé Z}. This means a = 9” for some integer ne€ Z. Thus 
a = 9” = (32)" = 32”. This shows a is an integer power of 3, so a € {3":n€Z}. 
Therefore a € {9”:n€ Z} implies a € {3" :n€ Z}, so {9 :n€ Z}S {3”" :n€ Z}. 


But notice {9” :ne Z}4 {3" :ne Z} as 3 e€ {3" :ne Z}, but 3¢ {9 :ne Z}. 8 


Suppose A and B are sets. Prove ACB if and only if A-B=¢@. 


Proof. First we will prove that if Ac B, then A-—B= 9. Contrapositive proof is 
used. Suppose that A—B#@. Thus there is an element a¢ A-—B, which means 
acA but a¢B. Since not every element of A is in B, we have AZB. 


Conversely, we will prove that if A—B = g, then AC B. Again, contrapositive proof 
is used. Suppose A ¢B. This means that it is not the case that every element of 
A is an element of B, so there is an element ac A with a¢ B. Therefore we have 
ac A-B,soA-BF®. | 


For each a eR, let Aq = {(x,a(x? - 1))€ R?: x R}. Prove that (] Aa = {(-1,0),(1,0)}. 
acR 


Proof. First we will show that {(-1,0),(1,0)} ¢ () Aa. Notice that for any a€R, we 
aceR 

have (—1,0)€ Ag because A, contains the ordered pair (—1,a((—1)? — 1)) = (-1,0). 

Similarly (1,0)¢ Ag. Thus each element of {(—1,0),(1,0)} belongs to every set Ag, 

so {(-1,0),(1,0)}¢ () Aa. 


aceR 
Now we will show [() Aa € {(-1,0),(1,0)}. Suppose (c,d)€ (| Aa. This means (c,d) 
aceR aceR 


is in every set Ay. In particular (c,d) € Ao = {(x, O(a? —1)):x€R} = {(x,0): xe R}. It 
follows that d= 0. Then also we have (c,d) = (c,0) € Ay = {(x, 1(x?-1)):xeR} = 
{(x,x2-1):x¢€R}. Therefore (c,0) has the form (c,c? — 1), that is (c,0) =(c,c? - 1). 
From this we get c?-1=0, soc =+1. Therefore (c,d) = (1,0) or (c,d) = (-1,0), so 
(c,d) € {(—1,0),(1,0)}. This completes the demonstration that (c,d) € {) Aa implies 


aceR 


(c,d) € {(-1,0),(1,0)}, so it follows that (] Aa  {(-1,0),(1,0)}. 
acR 
Now it’s been shown that {(—1,0),(1,0)} S (] Aq and (] Aq € {(-1,0),(1,0)}, so it 


acR aceR 
follows that (| Aa = {(-1,0),(1,0)}. = 


acR 


Suppose A,B,C and D are sets. Prove that (A x B)U(C x D) <(AUC)x (BUD). 


Proof. Suppose (a,b)€(A x B)U(C xD). 
By definition of union, this means (a,b) € (A x B) or (a,b) €(C x D). 
We examine these two cases individually. 


Richard Hammack Book of Proof 


27. 


29. 


31. 


331 


Case 1. Suppose (a,b) «(A x B). By definition of x, it follows that ac A and be B. 
From this, it follows from the definition of U that ae AUC and be BUD. 
Again from the definition of x, we get (a,b) « (AUC) x (BUD). 


Case 2. Suppose (a,b) €(C x D). By definition of x, it follows that ae C and be D. 
From this, it follows from the definition of U that ae AUC and be BUD. 
Again from the definition of x, we get (a,b)« (AUC) x (BUD). 


In either case, we obtained (a,b) € (AUC) x (BUD), 
so we’ve proved that (a,b) «(A x B)U(C x D) implies (a,b) € (AUC) x (BUD). 
Therefore (A x B)U(C x D) < (AUC) x (BUD). a 


Prove {12a+4b:a,b€ Z}= {4c:ce€ Z}. 


Proof. First we show {12a +4b:a,b€ Z}<¢ {4c:c€ Z}. Suppose x € {12a +4b:a,be Z}. 
Then x = 12a +46 for some integers a and b. From this we get x = 4(3a+ b), so 
x= 4c where c is the integer 3a + b. Consequently x € {4c:c € Z}. This establishes 
that {12a +46 :a,be Z}¢ {4e:ce Zh. 


Next we show {4c:ce Z}¢ {12a +4b:a,b€ Z}. Suppose xe {4c:ce Z}. Then x= 4c 
for some ce Z. Thus x = (12+ 4(-2))c = 12c + 4(-2c), and since c and —2c are 
integers we have x € {12a +4b:a,be Z}. 


This proves that {12a + 4b:a,b€ Z}={4c:ce Z}. a 


Suppose A # @. Prove that Ax BC AxC, if and only if BCC. 


Proof. First we will prove that if Ax BCAxC, then B<C. Using contrapositive, 
suppose that B ¢ C. This means there is an element b€ B with b¢C. Since A Z@, 
there exists an element ae A. Now consider the ordered pair (a,b). Note that 
(a,b)€ A xB, but (a,b)¢AxC. This means Ax BZAxC. 


Conversely, we will now show that if BCC, then Ax BCAxC. We use direct 
proof. Suppose B<C. Assume that (a,b)¢ Ax B. This means ace A and be B. 
But, as BCC, we also have b€ C. From ae A and b€C, we get (a,b)Ee AxC. 
We’ve now shown (a,b)<¢ A x B implies (a,b)e AxC, soAxBCAXxC. a 


Suppose B4¢ and Ax BCBxC. Prove ACC. 


Proof. Suppose B 4 ¢ and Ax BCBxC. In what follows, we show that ACC. 


Let xe A. Because B is not empty, it contains some element b. Observe that 
(x,b)e AxB. But as Ax BC BxC, we also have (x,b)¢ BxC, so, in particular, 
xé€B. Asxe A and xeB, we have (x,x)«€ AxB. Butas AxBCBxC, it follows that 
(x,x)€BxC. This implies xe C. We’ve shown x¢ A implies xe C, so ACC. | 


Free PDF version [S)= 


332 Solutions 


Chapter 9 Exercises 


1. 


11. 


13. 


15. 


17. 


If x,y ER, then |x + y| = |x|+|yI. 

This is false. 

Disproof: Here is a counterexample: Let x= 1 and y=-—1. Then |x+y|=0 and 
|x| +|y| =2, so it’s not true that |x + y| = |x|+|yI. 


. IfneZ and n°—n is even, then n is even. 


This is false. 
Disproof: Here is a counterexample: Let n= 3. Then n>—n = 35-3 = 240, but n is 
not even. 


. If A, B,C and D are sets, then (A x B)U(C x D) =(AUC) x (BUD). 


This is false. 

Disproof: Here is a counterexample: Let A = {1,2}, B = {1,2}, C = {2,3} and 
D = {2,3}. Then (A x B)U(C x D) = {(1, 1), (1, 2), (2, 1), (2, 2)} u {(2, 2), (2, 3), (3, 2), (3, 3)} = 
{(1, 1), (1, 2), (2, 1), (2, 2), (2, 3), (3, 2),(3,3)}. Also (A UC) x (BUD) = {1,2,3} x {1,2,3}= 
{(1, ),(, 2), (1, 8), (2, 1), (2, 2), (2, 3), (3, 1), (3, 2), (3, 3)}, so you can see that (A x B)U(C x 
D)#4(AUC)x (BUD). 


. If A, B and C are sets, and Ax C=BxC, then A=B. 


This is false. 
Disproof: Here is a counterexample: Let A = {1}, B = {2} and C=¢. Then Ax C= 
BxC=@, but AZB. 


. If A and B are sets, then A(A)— A(B)¢ A(A—B). 


This is false. 

Disproof: Here is a counterexample: Let A = {1,2} and B = {1}. Then A(A)- 
P(B) = {@, {1}, {2}, {1, 2}} —{0, {1} = {{2}, {1,2}}. Also A(A -B)= YA({2}) = {, {2}. In 
this example we have A(A)- A(B)Z A(A-B). 


Ifa,beN, thena+b<ab. 

This is false. 

Disproof: Here is a counterexample: Let a=1 and b=1. Thena+b=2andab=1, 
so it’s not true that a+b <ab. 


There exists a set X for which RC X and eX. 

This is true. 

Proof. Simply let X =Ru{@}. If xe R, then x € RU{@}= X, soRCX. Likewise, 
®@€RU{G}=X because @ € {G}. a 


Every odd integer is the sum of three odd integers. 

This is true. 

Proof. If n is odd, then n = n+1+(-1). Thus 7 is the sum of three odd integers. 
For all sets A and B, if A-B=@, then B#@. 

This is false. 

Disproof: Here is a counterexample: Just let A= % and B=¢. Then A-B=@, 
but it’s not true that BZ @. 


Richard Hammack Book of Proof 


333 


19. For every r,s € Q with r<s, there is an irrational number u for which r <u <s. 


21. 


23. 


This is true. 


Proof. (Direct) Suppose r,s € Q with r < s. Consider the number u =r+ 255". In 
what follows we will show that u is irrational and r<u<s. Certainly since s—r 
is positive, it follows that r<r+ 255" =u. Also, since /2 <2 we have 


s-r 


2” 


u=r+V2 > <r+2 
and therefore u < s. Thus we can conclude r <u <s. 
Now we just need to show that u is irrational. Suppose for the sake of contradic- 
tion that u is rational. Then u = ¢ for some integers a and b. Since r and s are 
rational, we have r= $ and s= ; for some c,d,e,f « Z. Now we have 


u = r+vor— 
2 
a _ ec, ptf 
= 2 
b ra 
ad-—bc ed-—cf 
a 2 
bd v2 2df 
(ad — bc)2df 
pee ee = 2 
bd(ed -cf) v2 


This expresses V2 as a quotient of two integers, so V2 is rational, a contradiction. 
Thus wu is irrational. 


In summary, we have produced an irrational number u with r<wu<s, so the 
proof is complete. a 


There exist two prime numbers p and q for which p — q = 97. 

This statement is false. 

Disproof: Suppose for the sake of contradiction that this is true. Let p and q be 
prime numbers for which p— gq = 97. Now, since their difference is odd, p and q 
must have opposite parity, so one of p and q is even and the other is odd. But 
there exists only one even prime number (namely 2), so either p = 2 or q =2. If 
p =2, then p-q = 97 implies g = 2-97 = —95, which is not prime. On the other 
hand if g = 2, then p- q = 97 implies p = 99, but that’s not prime either. Thus one 
of p or q is not prime, a contradiction. 


If x,y¢€R and x? < y?, then x< y. This is true. 


Proof. (Contrapositive) Suppose x > y. We need to show x? = y?. 

Case 1. Suppose x and y have opposite signs, that is one of x and y is positive 
and the other is negative. Then since x= y, x is positive and y is negative. Then, 
since the powers are odd, x° is positive and y? is negative, so x° > y°. 

Case 2. Suppose x and y do not have opposite signs. Then x7+xy+y?>0 and 


Free PDF version [S)= 


334 Solutions 


25. 


27. 


29. 


31. 


33. 


35. 


also x— y>0 because x > y. Thus we have x? — y? = (x— y)(x?+xy+ y2)>0. From 
this we get x°- y?>0, so x? > y?. 
In either case we have x® > y?. | 


For all a,b,cé Z, ifa|bc, thena|bora|c. 
This is false. 
Disproof: Let a=6, b=8 and c=4. Note that a| bc, but a{b andafc. 


The equation x? = 2* has three real solutions. 


Proof. By inspection, the numbers x = 2 and x = 4 are two solutions of this 
equation. But there is a third solution. Let m be a positive real number for 
which m2” = 3 (The existence of such an m is guaranteed by the intermediate 
value theorem of calculus.) Then negative number x = —2m is a solution, as 


2 
m2 2 1 1 . 
1 =(-2m)? =m? =4{ 7] -4( 2) = = 272" 9%, 


Therefore we have three solutions 2, 4 and m. | | 


If x,ye€R and |x+ y|=|x—y|, then y=0. 
This is false. Disproof: Let x =0 and y=1. Then |x+ y|=|x—y|, but y=1. 


No number appears in Pascal’s triangle more than four times. 

This is false. Disproof: The number 120 appears six times. Check that (12) = 
(P) = (2) = (22) = (42°) = (i) = 120. 

Suppose f(x) = ao + a,x +a2x7+---+anx” is a polynomial of degree 1 or greater, 
and for which each coefficient a; is in N. Then there is an n €N for which the 
integer f(n) is not prime. 


Proof. (Outline) Because the coefficients are all positive and the degree is greater 
than 1, we have f(1)>1. Let b = f(1)>1. The polynomial f(x)-—b has a root 1, so 
f(x)-—b =(x- 1)g(x) for some polynomial g. Then f(x) = (x—- 1)g(x)+b. Note that 
f(6+1) = bg(b+1)+b = b(g(b+1)+1). If we can show that g(b+1)+1 is an integer 
greater than 1, then we have a nontrivial factoring f(b + 1) = b(g(6+ 1)+ 1), so 
f(6+1) is not prime. To complete the proof, use the fact that f(x)—b = (x—1)g(x) has 
integer coefficients, and deduce that g(x) must also have integer coefficients. 


The converse is false. The number n = 11 is a counterexample: It is prime, but 
211_ 1 = 2047 = 23-89 is not prime. (See also the table on page 166.) 


Chapter 10 Exercises 


1. 


24n 


Prove that 14+2+3+4+---+n= 2 


for every integer neN. 
Proof. We will prove this with mathematical induction. 


1741 
(1) Observe that if n = 1, this statement is 1= oe which is obviously true. 


Richard Hammack Book of Proof 


335 


(2) Consider any integer k => 1. We must show that S; implies S;.;. In other 


words, we must show that if 1+2+3+4+---+k= eek is true, then 


(k+1)2+(k+1) 


1424+3+44+---+k+(R+1)= 5 


is also true. We use direct proof. 
Suppose k=>1and1+2+3+4+---+k= ae Observe that 


14+24+384+44+---+k+(kR+1) = 
(14+24+3+4+4+---+k)+(R+1) = 
2 2 
k Ey a4) - k*+k+2k+1) 
2 2) 
h24+2k+1+k+1 
2 
(k+1)?+(k+1) 
5 : 


Therefore we have shown that 1+2+3+4+---+k+(k+1)= Ga ae a 


3. Prove that 19 + 2°+33+49+---4+n3 = cata for every positive integer n. 


Proof. We will prove this with mathematical induction. 

(1) When n= 1 the statement is 1° = Mes = 4 = 1, and this is true. 

(2) Now assume the statement is true for some integer n =k = 1, that is assume 
13+ 23 +334 434...423= ca ay Observe that this implies the statement 
is true forn=k+1: 


142° 43? 44? eo ek eka 
(134.23 +33 4434...423)+(k4+1)? = 
k2(k +1)" 


k2(k + 1)? , A(k +198 
4 4 
h2(k+1)2+4(k +1)? 
4 
(k+1)2(k?2 + 4(k + 1)!) 
4 
(k+1)*(k2 + 4k +4) 
4 
(k+1)2(k +2)? 
4 
(k+1)2(k+1)+1)? 
; . 


+(k+1)° 


Therefore 12+ 22+324434+..-+23+(k+1)% = ee which means the 
statement is true forn=k+1. | 


Free PDF version [S)= 


336 Solutions 


5. IfneN, then 21+27+23+...42" =g"+1_9. 


Proof. The proof is by mathematical induction. 

(1) When n =1, this statement is 2! = 2!+!-2, or 2=4-2, which is true. 

(2) Now assume the statement is true for some integer n =k = 1, that is assume 
21492493 4...42% = 2'+1_9. Observe this implies that the statement is 
true for n =k +1, as follows: 


DE AP OS havik ORIEL «x: 
(O40? 498400 OM a ORE 
gktl_o4 oktl = 9.ghtl_9 
gkt2 _9 
ghktD+1_ 9 


Thus we have 2!+27423+4...4.2% + 2k+1 = 9@+)+1_9 50 the statement is true 


forn=k+1. 
Thus the result follows by mathematical induction. a 
7. IfneN, then 1-3+2-44+3-5+4-6+---+n(n+2)= meee 


Proof. The proof is by mathematical induction. 

(1) When n=1, we have 1-3= He which is the true statement 3 = #. 

(2) Now assume the statement is true for some integer n =k = 1, that is assume 
1-34+2-44+3-54+4-64+---+R(R+2)= BEER ED Now observe that 


1-342-443-54+4-64+---+R(R+2)+(R+1)(R+1)+2) = 
(1-342-44+3-54+4-64+---+Rk(R+2))+(R+1)(R+1)+2) = 
aA aaa ey eh ee 


6 
k(k+1)(2k +7) i 6(k + 1)(k +3) 


6 6 
k(kR+1)(2k +7)+ 6(R+1)(k +3) 


6 
(k+1)(R(2k + 7)+ 6(R + 38)) 


6 
Gears wer’) 


6 
(k+1)\(k+2)(2k +9) 


6 
(k+1)(k+1)+1)(2(k +1)+7) 


6 


Thus we have 1-3+2-4+3-5+4-6+---+k(R+2)+(k+1)((R+1)+2) = GUC DOD 
and this means the statement is true forn=k+1. 
Thus the result follows by mathematical induction. a 


Richard Hammack Book of Proof 


337 


9. Prove that 24 | (52” — 1) for any integer n > 0. 


Proof. The proof is by mathematical induction. 

(1) For n=0, the statement is 24| (52-1). This is 24|0, which is true. 

(2) Now assume the statement is true for some integer n = k = 0, that is assume 
24 | (52* — 1). This means 52 — 1 = 24a for some integer a, and from this we 
get 52* = 24a +1. Now observe that 


52k+1) = a 
rai = 


5°(24a+1)-1 = 
25(24a+1)-1 = 
25-24a+25-1 = 24(25a+1). 


This shows 52*+) — 1 = 24(25a +1), which means 24 | 524+) — 1. 
This completes the proof by mathematical induction. | 


11. Prove that 3|(n?+5n+6) for any integer n= 0. 


Proof. The proof is by mathematical induction. 

(1) When n =0, the statement is 3|(0?+5-0+6), or 3|6, which is true. 

(2) Now assume the statement is true for some integer n = k = 0, that is assume 
3|(k°+5k+6). This means k? +5k+6 = 3a for some integer a. We need to 
show that 3|((k + 1)? +5(k + 1)+6). Observe that 


(k+1)?+5(R+1)+6 = k®+3k27+3k+1+5k+5+6 
= (k°+5k+6)+3k7+3k+6 
= 3a+3k2+3k+6 
= 3(a+k2+k+2). 


Thus we have deduced (k + 1)? +. 5(k+1)+6 = 3(a+k?+k+2). Since at+k2+k+2 
is an integer, it follows that 3| (+ 1)? + 5(k +1) +6). 
It follows by mathematical induction that 3|(n? +5n+6) for every n= 0. a 


13. Prove that 6|(n?—n) for any integer n= 0. 


Proof. The proof is by mathematical induction. 
(1) When n =0, the statement is 6 | (0? — 0), or 6| 0, which is true. 


Free PDF version [S)= 


338 Solutions 


(2) Now assume the statement is true for some integer n = k = 0, that is, assume 
6|(k°—k). This means k® — k = 6a for some integer a. We need to show that 
6|(k+1)?-(k+1)). Observe that 


(k+1)?-(k+1) = kh? +3k24+3k+1-k-1 
= (k®—k)+3k2+3k 
= 6a+3k2+3k 
= 6a+3k(k +1). 


Thus we have deduced (k +1)? —(k + 1) = 6a + 3k(k +1). Since one of k or (k +1) 
must be even, it follows that k(k +1) is even, so k(k +1) = 26 for some integer 
b. Consequently (k + 1)? —-(k +1) = 6a + 8k(k +1) = 6a + 3(2b) = 6(a +b). Since 
(k+1)?-(k+1)=6(a +b) it follows that 6 | ((k + 1)? -(k +1)). 


Thus the result follows by mathematical induction. a 
15. IfneN, then + g5+5g+aet tap =1-se- 


Proof. The proof is by mathematical induction. 

(1) When n =1, the statement is wt =1- up which simplifies to 5 = 3 

(2) Now assume the statement is true for some integer n = k = 1, that is assume 
I + a5 + a + i feet wn =1- 1° Next we show that the statement for 
n=k+1is true. Observe that 


Ws, Ae ey AS, A 1 1 

Lo 239-4 45 REET) EINES 1) ED) 
i, ae, Sat 1 1 

$e $a et + 

1-2 2:3 38:4 4-5 R(k+1)} (R+1)(k+2) 
ee 1 

| El)” GEARED 
ee 1 

k+l + 1(k+2) 
k+2 1 

(+ 1k+2) + Ik +2) 

k+1 
(k+1)(R +2) 
1 


 R+2 
1 


at 


which is to 


‘ F sae ee 1 a 
This establishes -5+93+ 34+a5+°+ @angepa =! 
say that the statement is true for n =k +1. 


1 
(k+1)+1? 


This completes the proof by mathematical induction. a 


Richard Hammack Book of Proof 


17. 


19. 


21. 


339 


Suppose Aj,Ao,...A, are sets in some universal set U, and n= 2. Prove that 
AiNAgn::-NAp, = AyUAgu-:-UAy. 


Proof. The proof is by strong induction. 
(1) When n = 2 the statement is Aj nN Ag = A, UAg. This is not an entirely 
obvious statement, so we have to prove it. Observe that 


A,nNAg = {x:(xeU)A(x¢A,NAg)} (definition of complement) 
= {x:(xEeU)A~(xE A, NAg)} 
= {x:(xeU)A~((xE Ay) A(x€ Ag))} (definition of n) 
= {x:(xE€U)A(~ (xe Ay)v ~(x€ Ag))} (DeMorgan) 
= {x:(xEU)A(x¢ At) V(x ¢ Ag))} 
= {x:(xeEU)A(x¢ At) V(x EU) A(x ¢ Ag)} (distributive prop.) 
= {x:(xeU)A(x€ Az))}U fx: (xe U) A(x ¢ Ag))} (def. of u) 
= A,UAg, (definition of complement). 


(2) Let k=2. Assume the statement is true if it involves k or fewer sets. Then 


AiynAgn:::NAp-iNApNAgi1t = 


AyNAgn:NAp_yN(AgNAgey) = AyUAgu---UAg-7~UARNARs1 
= A,UAgU---UA,_1U Az UAgs 1. 


Thus the statement is true when it involves k + 1 sets. 
This completes the proof by strong induction. a 


n 
Prove x 3s <2-+ for every n. 


Proof. This clearly holds for n = 1. Assume it holds for some n = 1. Then = = 
k= 


(n+1)?—-n z n_g_n 24n+1 D) n2+n =2 


1 1 = 1 
Le kh + mae <2 nt (n+1)2 2 n(n+1) n(n+1)2 n(n+1)2 — (n+1)° 


1,1,1 1 n 
IfneN, then 7+ 5+3+°:-+ or 21+ §. 


Proof. If n=1, the result is obvious. 
Assume the proposition holds for some n > 1. Then 


111 
toto tet 
1 2 38 


1 1 1 1 
tititet+ + + tenet 
1 2 8 an 2?+1° 274+2 2% 4+3 gn+l 
n 1 1 1 1 
= (1+ )+ + + +e-+—_]. 
2 2?+10 274+2 2743 gntl 


t= (tebe 1 


gntl 


Now, the sum (ata + gig t Bes freed | on the right has oa 2” = 2” terms, 


all greater than or equal to so the sum is greater than 2” —, sat = 5 Therefore 


1 1 1 1 1 i mya 1 _ 
we get itgtgt tg > (14+ 3) + (stg + ghp t tg tt gd] = (1+ 3)+5= 


1+ ned This means the result is true for n + 1, so the theorem is proved. a 


at ? 


Free PDF version [S)= 


340 Solutions 
23. Use induction to prove the binomial theorem (x + y)” =", (j)x”"'y’. 
Proof. Notice that when n = 1, the formula is (x + y)! = (j)xty° + (j)x°y! =x+y, 
which is true. 


Now assume the theorem is true for some n > 1. We will show that this implies 
that it is true for the power n+1. Just observe that 


yet = (wt ya+y)” 


= (x+ yy [7 ery 


i=0 


= (ee + ae 
i=0\? 


(x+y 


= 3 td: int D-1 i nt+1 n+1 
nt+1 
n+1 oe a 
= | ie iat 
i=0 L 


This shows that the formula is true for (x + y)"*1, so the theorem is proved. 
25. Concerning the Fibonacci sequence, prove that F1+Fo+F3+F4+...+ Fp =Fn+2-1. 


Proof. The proof is by induction. 

(1) When n= 1 the statement is Fy = Fy49-1=F3-1=2-1=1, which is true. 
Also when n = 2 the statement is F, + Fo = Fo.9 -1=F4-1=3-1=2, which 
is true, as Fj + Fo=14+1=2. 

(2) Now assume k= 1 and Fy +Fo+F3+F4+...+F, =Fpi9g—1. We need to show 
Fi +Fot+F3+Fat+...+Fp+ Fei =Fri3—1. Observe that 


Fy +Fo+F3+F4a+...+F,p+Feit = 
(FP, + Fo+F3+F4+...+F,)4+Fpsy = 
Ppig-14+F pit (Fpiit+Fe+2)-1 


Fei3 1. 


This completes the proof by induction. a 


27. Concerning the Fibonacci sequence, prove that F, + F3+---+Fon_1 = Fon. 
n 
Proof. If n=1, the result is clear. Assume for some n> 1 we have ¥. Fo;-1 = Fon. 
i=l 


n+1 n 
Then x Foi-1 = Fonsi + X Point = Fonsi + Fon = Fone = Fon+1) as desired. a 
i= {= 


Richard Hammack Book of Proof 


341 


29. Notice that the sum of elements on the nth diagonal has the form 


31. 


33. 


Od re aaa ad ee aac 9 


(For example, (6) + (?)+ ($)+ (3)+ 
Therefore, we need to prove that (5 
each n= 0. 


) 
(2)+ (4 s+ (2 je 13 = F6:1.) 
)+ mt + ("374+ (33) 42-46) + (0) = Frat for 


Proof. (Strong Induction) For n = 1 this is (5) + ({) =1+0=1=F2=Fi41. Thus 
the assertion is true when n = 1. 


Now fix n and assume that (R)+ (i+ +(*5 ) 4 ( 


3 
k <n. In what follows we use the identity (7) 
() =0 whenever it is untrue that 0<b <a. 


Goatees ola 
BCS peed 
(}+(o'}(o} (7) 

Co } (oC (39) Bt rl 
a es ee | 


= Fyt+Fn-1=Fni1 


are +(,- : +t ) = Fz+1 whenever 
cr yy Ce 1). We also often use 


9 


This completes the proof. a 


Prove that 5 (*) =(2]), where rN. 
k=0 


r+1 


Hint: Use induction on n. If n = 0, the equation is (°) = (°t1), which is 0 =0. 


; ; t P, WIAY ao cee (n+1)+1 
For the inductive step, we must show that » (*) =("{}) implies ¥ (") =( in 
= k=0 


r+1 rt+1 


Thus assume > (*) =("{}). By Pascal’s formula, (*)*1) =("*1) + (241). Now use 
k=0 


n+1 
the inductive hypothesis and Pascal’s formula again to transform this to ) ak 
k=0 


Suppose that n infinitely long straight lines lie on the plane in such a way that 
no two are parallel, and no three intersect at a single point. Show that this 
arrangement divides the plane into a regions. 

Proof. The proof is by induction. For the basis step, suppose n = 1. Then there 
is one line, and it clearly divides the plane into 2 regions, one on either side of 
the line. As 2= Had? = = eine the formula is correct when n = 1. 


Free PDF version [S)= 


342 Solutions 


35. 


37. 


Now suppose there are n+1 lines on the plane, and that the formula is correct 
for when there are n lines on the plane. Single out one of the n +1 lines on the 
plane, and call it 2. Remove line @, so that there are now n lines on the plane. 


By the induction hypothesis, these n lines di- el 
vide the plane into nitnt2 regions. Now add 15 
line ¢ back. Doing this adds an additional “4 
n+1 regions. (The diagram illustrates the 

case where n+1=5. Without /, there are 5 
n=4 lines. Adding ¢ back produces n+1=5 
new regions.) ‘1 


Thus, with n+1 lines there are all together (n + 1)+ nitn+? regions. Observe 


n24+n+2  Ant+2+n?2+n+2 (n+l)? +(n+1)+2 

= 2 ~ 2 ; 
(n+1)?+(n+1)+2 
ny ae 


(n+1)+ 


Thus, with n+1 lines, we have regions, which means that the 
formula is true for when there are n+1 lines. We have shown that if the formula 
is true for n lines, it is also true for n+ 1 lines. This completes the proof. B 


If n,k EN, and n is even and k is odd, then (;) is even. 


Proof. Notice that if k is not a value between 0 and n, then (7) = 0 is even; thus 
from here on we can assume that 0<k <n. We will use strong induction. 


For the basis case, notice that the assertion is true for the even values n = 2 and 
n=4: (7) =2; ({) =4; ({) =4 (even in each case). 


Now fix an even n assume that (77) is even whenever m is even, k is odd, and 
n-1 


m <n. Using the identity (7) = (77 }) + (";") three times, we get 


i 


ll 
—~ 
~ s 
bed 
ee 
~_ 
+ 
—~. 
3 
= | 
ie, 


Il Il 
—~. ——. 
rs ~ s 
ld «l 
1 oR Wo) So we 
ah RS, 

+ + 

oO -_-:™ 
—_m wes 
xs 
1 ot wy 
Bw 
Se 

oF: — 
—~ xe 3 

ms) 
=~ 1 pw 
wo 
to 
—~ 
) 
om 
bo 
Ne 


Now, n-—2 is even, and k and k-2 are odd. By the inductive hypothesis, the outer 
terms of the above expression are even, and the middle is clearly even; thus we 
have expressed (7) as the sum of three even integers, so it is even. a 


Prove that if m,n e€N, then Peg cr ey oe Cae 


Proof. We will use induction on n. Let m be any integer. 
1 
(1) If n=1, then the equation is Y k(**) = 177741) -(™*15}), and this is 
k=0 


m+1 m+2 


o(™)+.1("*1) = 172) - (+2), which yields the true statement m+1=m+2-1. 


Richard Hammack Book of Proof 


343 


(2) Now let n>1 and assume the equation holds for n. (This is the inductive 
hypothesis.) Now we will confirm that it holds for n +1. Observe that 


n+1 
Y {" ‘i ‘ = (left-hand side for n +1) 
k=0 i 
2 (™ ‘Ya +" ove | - (split off final term) 
m 
mt+n+1 mt+n+1 mt+nt+1 . : : 
i | - | | +(n+ o| | = (apply inductive hypothesis) 
mt+l1 m+2 m 
“fren ' ‘ ee ' _ een ‘ieee orn | _ (Pascale rmula) 
mt+1 m+t+1 m+2 m 
aa) (ine aed res ania (actor) 
mt+l1 m+2 m 
ais es el Gaktoravain) 
+1 m mt+2 
eee m+nt+2 : 
(n+1) = (Pascal’s formula) 
m+2 
(n+ sre oer al '|- (” li ' (right-hand side forn+1) & 
m 


39. Prove that > (e)(, te) = (nip) for non-negative integers m,n and p. 


Proof. We will use induction on n. . Let m and p be any non-negative integers. 
(1) Ifn=0, then the equation is 2 es Vote) = = (™*°). This holds if p > 0, because 


m+p 
then Cae 0=(,1p)> and both sides of the equation are zero. If p =0, the 


equation is 2 (7) (2) =”), and both sides equal 1. 
(2) Now take n= 7 and suppose the equation holds for n. (This is the inductive 
hypothesis.) Next we confirm that the equation holds for n +1. 


(” ee | (right-hand side for n+ 1) 
mt+ Pp 
= Mere (7 a (Pascal’s formula) 
m+(p-1)} |\m+p 
aie el (apply inductive hypothesis) 
= pa lhe aa) Slellecal apply inductive hypothesis 
-y b | = eal ip: A (combine) 
me nt+1 , 
-2 " \ " A (Pascal’s formula) 
This final expression is left-hand side for n + 1, so the proof is finished. a 


Free PDF version [S)= 


344 Solutions 


41. Ifn and k are non-negative integers, then (”5°) +("{1)+ ("S2) 42-4 ("7") =(" 4). 


Proof. We will use induction on k. Let n be any non-negative integer. 
(1) Ifk=0, then the equation is ("}°) = (”*0*'), which reduces to 1= 1. 


(2) Assume the equation holds for some k = 1. (This is the inductive hypothesis.) 
Now we will show that it holds for k+1. Note that 


n+0 n+1 n+2 nt+k n+(k+1) . 
| P }-( }-( 5 |. | . }-( ne | (left side for k + 1) 
n+k+1 n+k+1 : : F 
= 7 | + | ee | (apply inductive hypothesis) 
= (" they (Pascal’s formula) 
k+1 
= : meri i} (right-hand side for k + 1) 
k+1 
The proof is complete. - 


Chapter 11 Exercises 
Section 11.1 


1. Let A = {0,1,2,3,4,5}. Write out the relation R that expresses > on A. Then 
illustrate it with a diagram. 
2 1 


R= {6,4), 6,3), 6,2), 6D, (6,0), (4,3), (4,2), (4,0, 
(4,0), (8,2), 3,1, (8,0), 2,D, 2,0), (,0)} 


4 5 


3. Let A = {0,1,2,3,4,5}. Write out the relation R that expresses = on A. Then 
illustrate it with a diagram. 


R = { (5,5), (5,4), (5,3), (5,2), (5, 1), (5,0), 
(4,4), (4,3), (4,2), (4, 1), (4,0), 
(3,3), (3,2), (3, 1), (3,0), 


(2,2), (2,1), (2,0), (1,1), (1,0), (0,0)} 


5. Write the sets A and R for the diagramed relation. Answer: A = {0,1,2,3,4,5}; 
R = {(3,8), (4,3), (4, 2), (1, 2), (2,5), (5, 0)} 

7. Write the relation < on the set A= Z as a subset R of Zx Z. This is an infinite 
set, so you will have to use set-builder notation. 
Answer: R = {(x,y)€ Zx Z:y-xeEN} 


Richard Hammack Book of Proof 


9. 


11. 


345 


How many different relations are there on the set A = {1,2,3,4,5,6}? 

Consider forming a relation R< Ax A on A. For each ordered pair (x,y)¢ Ax A, 
we have two choices: we can either include (x, y) in R or not include it. There are 
6-6 = 36 ordered pairs in A x A. By the multiplication principle, there are thus 
2°6 different subsets R and hence also this many relations on A. 


Answer: 2(4!”) 13. Answer: 4 15. Answer: = (mod 3) 


Section 11.2 


1. 


Consider the relation R = {(a,a),(b,b),(c,c),(d,d), (a, b),(b,a)} on the set A = {a, b,c, d}. 
Which of the properties reflexive, symmetric and transitive does R possess and 
why? If a property does not hold, say why. 

This is reflexive because (x,x)¢R (i.e., xRx )for every xe A. 

It is symmetric because it is impossible to find an (x, y)¢ R for which (y,x)¢R. 
It is transitive because (xRy A yRz) > xRz always holds. 


. Consider the relation R = {(a,b),(a,c),(c, b),(b,c)} on the set A = {a,b,c}. Which of 


the properties reflexive, symmetric and transitive does R possess and why? Ifa 
property does not hold, say why. 

This is not reflexive because (a,a) ¢ R (for example). 

It is not symmetric because (a,b)¢ R but (b,a)¢R. 

It is not transitive because cRb and bRc are true, but cRc is false. 


. Consider the relation R = {(0,0),(V2,0), (0, V2),(V2, V2)} on R. Say whether this 


relation is reflexive, symmetric and transitive. If a property does not hold, say 
why. 

This is not reflexive because (1,1) ¢R (for example). 

It is symmetric because it is impossible to find an (x, y)¢ R for which (y,x)¢R. 
It is transitive because (xRy A yRz) > xRz always holds. 


. There are 16 possible different relations R on the set A = {a,b}. Describe all of 


them. (A picture for each one will suffice, but don’t forget to label the nodes.) 
Which ones are reflexive? Symmetric? Transitive? 


ae eb @ +b ae) e 
aah @—*b ae) C ) 
Ae—e_*b (Grn b Ae _+@) (G ©) 
ab (Gb a< >) (<>) 


Only the four in the right column are reflexive. Only the eight in the first and 
fourth rows are symmetric. All of them are transitive except the first three on 
the fourth row. 


Free PDF version [S)= 


346 Solutions 


9. 


11. 


13. 


15. 


17. 


Define a relation on Z by declaring xRy if and only if x and y have the same 
parity. Say whether this relation is reflexive, symmetric and transitive. Ifa 
property does not hold, say why. What familiar relation is this? 

This is reflexive because xRx since x always has the same parity as x. 

It is symmetric because if x and y have the same parity, then y and x must 
have the same parity (that is, xRy > yRx). 

It is transitive because if x and y have the same parity and y and z have the 
same parity, then x and z must have the same parity. (That is (xRy A yRz) > xRz 
always holds.) 

The relation is congruence modulo 2. 


Suppose A = {a,b,c,d} and R = {(a,a),(b,b),(c,c),(d,d)}. Say whether this relation 
is reflexive, symmetric and transitive. If a property does not hold, say why. 
This is reflexive because (x,x)¢R for every xe A. 

It is symmetric because it is impossible to find an (x, y)¢ R for which (y,x)¢R. 
It is transitive because (xRy A yRz) > xRz always holds. 

(For example (aRa \aRa)= aRa is true, etc.) 


Consider the relation R = {(x,y)e Rx R:x—ye€Z} on R. Prove that this relation is 
reflexive and symmetric, and transitive. 


Proof. In this relation, xR y means x- ye Z. 


To see that R is reflexive, take any x € R and observe that x-x=0€ Z, so xRx. 
Therefore R is reflexive. 


To see that R is symmetric, we need to prove xRy=> yRx for all x,yeR. We use 
direct proof. Suppose xRy. This means x— ye Z. Then it follows that —(x—) = y—x 
is also in Z. But y—xe Z means yRx. We’ve shown xRy implies yRx, so R is 
symmetric. 


To see that R is transitive, we need to prove (xRyA yRz) => xRz is always true. We 
prove this conditional statement with direct proof. Suppose xRy and yRz. Since 
xRy, we know x— ye Z. Since yRz, we know y—ze Z. Thus x—y and y—z are both 
integers; by adding these integers we get another integer (x- y)+(y—z)=x-z. 
Thus x—ze Z, and this means xRz. We’ve now shown that if xRy and yRz, then 
xRz. Therefore R is transitive. a 


Prove or disprove: If a relation is symmetric and transitive, then it is also 
reflexive. 


This is false. For a counterexample, consider the relation R = {(a,a),(a,b),(b,a),(b, b)} 


on the set A = {a,b,c}. This is symmetric and transitive but it is not reflexive. 


Define a relation ~ on Z as x ~ y if and only if |x— y| < 1. Say whether ~ is reflexive, 
symmetric and transitive. 

This is reflexive because |x —x|=0< 1 for all integers x. It is symmetric because 
x~ y if and only if |x— y| <1, if and only if |y—x| <1, if and only if y ~ x. It is not 
transitive because, for example, 0 ~ 1 and 1 ~ 2, but is not the case that 0 ~ 2. 


Richard Hammack Book of Proof 


347 


Section 11.3 


1. Let A = {1,2,3,4,5,6}, and consider the following equivalence relation on A: R = 
{(1, 1), (2, 2), (3, 8), (4, 4), (5, 5), (6, 6), (2, 3), (3, 2), (4, 5), (5, 4), (4, 6), (6, 4), (5,6),(6,5)}. List 
the equivalence classes of R. 

The equivalence classes are: [1]={1}; [2]=[8]={2,3}; [4]=[5]=[6] = {4,5, 6}. 

3. Let A = {a,b,c,d,e}. Suppose R is an equivalence relation on A. Suppose R has 
three equivalence classes. Also aRd and bRc. Write out R as a set. 

Answer: R = {(a,a),(b, b),(c, c),(d, d),(e, e), (a, d),(d,a), (b,c), (c, b)}. 

5. There are two equivalence relations on the set A = {a,b}. Describe them. 
Answer: R = {(a,a),(b,6)} and R = {(a,a),(b, b), (a, b),(b, a)} 

7. Define a relation R on Z as xRy if and only if 3x—5y is even. Prove R is an 
equivalence relation. Describe its equivalence classes. 


We must prove that R is reflexive, symmetric and transitive. 
The relation R is reflexive for the following reason. If x€ Z, then 3x —5x = —2x is 
even. But then since 3x — 5x is even, we have xRx. Thus R is reflexive. 


To see that R is symmetric, suppose xRy. We must show yRx. Since xRy, we 
know 8x-— 5y is even, so 3x —5y = 2a for some integer a. Now reason as follows: 


38x-5y = 2a 
38x-—5y+8y-8x = 2a+8y-8x 
8y-5x = 2at+4y-A4x). 


From this it follows that 3y—5x is even, so yRx. We’ve now shown xRy implies 
yRx, so R is symmetric. 


To prove that R is transitive, assume that «Ry and yRz. (We will show that this 
implies xRz.) Since xRy and yRz, it follows that 3x—5y and 3y—5z are both even, 
so 3x—5y = 2a and 3y—5z = 26 for some integers a and b. Adding these equations, 
we get (3x —5y)+(3y—5z) = 2a+2b, and this simplifies to 3x -—5z = 2(a+b+/y). 
Therefore 3x —5z is even, so xRz. We’ve now shown that if xRy and yRz, then 
xRz, so R is transitive. 

We’ve shown R is reflexive, symmetric and transitive, so it’s an equivalence 
relation. 

This completes the first part of the problem. Now we move on the second part. 
To find the equivalence classes, first note that 

[O]= {xe Z:xR0}= {xe Z:38x—5-0 is even} = {xe Z: 3x is even} = {xe Z:x is even}. 
Thus the equivalence class [0] consists of all even integers. Next, note that 
[1] = {xe Z:xR1} = {xe Z:3x—5-1 is even} = {xe Z:3x—5 is even} = {xe Z:x is odd}. 
Thus the equivalence class [1] consists of all odd integers. 


Consequently there are just two equivalence classes {...,-—4,-2,0,2,4,...} and 
{...,-3, -1,1,3,5,...}. 


Free PDF version [S)= 


348 Solutions 


9. 


11. 


13. 


Define a relation R on Z as xRy if and only if 4| (x+3y). Prove R is an equivalence 
relation. Describe its equivalence classes. 


This is reflexive, because for any x € Z we have 4|(x+3x), so xRx. 


To prove that R is symmetric, suppose xR y. Then 4 | (x+3y), sox+3y = 4a for some 
integer a. Multiplying by 3, we get 3x+9y = 12a, which becomes y+ 3x = 12a-8y. 
Then y+ 3x = 4(3a—2y), so 4|(y+8x), hence yRx. Thus we’ve shown xR y implies 
yRx, so R is symmetric. 


To prove transitivity, suppose xRy and yRz. Then 4|(x+3y) and 4|(y+3z), so 
x+38y=4a and y+3z=46 for some integers a and b. Adding these two equations 
produces x+4y+38z=4a+4b, or x+3z=4a+4b-4y=4(a+b-y). Consequently 
A|(x + 3z), so xRz, and R is transitive. 


As R is reflexive, symmetric and transitive, it is an equivalence relation. 


Now let’s compute its equivalence classes. 

[O] = {xe Z:xRO}= {xe Z:4|(x+3-O)}= {xe Z:4|x}= {...-4,0,4,8,12,16...} 
[1J={xeZ: xR ={xeZ:4|(x+3-D} = {xe Z:4|(x+3)} ={...-3,1,5,9,18,17...} 
[2]={xeZ:xR2}= {xe Z:4|(x+3-2)} = {xe Z:4|(x+6)} ={...—2,2,6,10,14,18...} 
[38] = {xe Z:xR3}= {xe Z:4|(x+3-3)} = {xe Z:4|(x+9)} ={...-1,3,7,11,15,19...} 
Prove or disprove: If R is an equivalence relation on an infinite set A, then R 
has infinitely many equivalence classes. 

This is False. Counterexample: consider the relation of congruence modulo 2. 
It is a relation on the infinite set Z, but it has only two equivalence classes. 


Answer: m|A| 15. Answer: 15 


Section 11.4 


1. 


5. 


List all the partitions of the set A = {a,b}. Compare your answer to the answer 
to Exercise 5 of Section 11.3. 

There are just two partitions {{a},{b}} and {{a,b}}. These correspond to the two 
equivalence relations R, = {(a,a),(b,b)} and Re = {(a,a),(a,b),(b,a),(b, b)}, respec- 
tively, on A. 


. Describe the partition of Z resulting from the equivalence relation = (mod 4). 


Answer: The partition is {[0],[1],[2],[3]}} = 
Tic, AOA 8,19 Veloce =8,9 5,818,028 Cr, 06 10,1419, f= 93,7, 1415.2) t 


Answer: Congruence modulo 2, or “same parity.” 


Section 11.5 


1. 


Write the addition and multiplication tables for Zo. 


+ | [0] [1] > | (0) [1] 


[0] | (0) [1] [0] | [0] [0] 
[1] | [1] [0] [1] | (0) [1] 


Richard Hammack Book of Proof 


3. 


349 


Write the addition and multiplication tables for Z4. 


+ | [0] [1] [2] [3] - | [0] [1] [2] [3] 


[0] | [0] [1] [2] [8] [0] | [0] [0] [0] [0] 
[1] | [1] [2] [8] [0] [1] | [0] [1] [2] [8] 
[2] | [2] [8] [0] [1] [2] | [0] [2] [0] [2] 
[3] | [38] [0] [1] [2] [3] | [0] [8] [2] [1] 


. Suppose [a],[b]€ Z5 and [a]-[b] = [0]. Is it necessarily true that either [a] = [0] or 


[b] = [0]? 


The multiplication table for Z; is shown in Section 11.5. In the body of that 
table, the only place that [0] occurs is in the first row or the first column. That 
row and column are both headed by [0]. It follows that if [a]-[b] =[0], then either 
[a] or [6] must be [0]. 


. Do the following calculations in Zg, in each case expressing your answer as [a] 


with 0<a<8. 
(a) [8] + [8] =[7] (b) [24]+[11] =[8] (ce) [21]-[15] = [0] (d) [8]-[8] =[1] 


Chapter 12 


Section 12.1 


1. 


11. 


Suppose A = {0,1,2,3,4}, B = {2,3,4,5} and f = {(0,3), (1,3), (2, 4),(3,2),(4,2)}. State 
the domain and range of f. Find f(2) and f(1). 
Domain is A; Range is {2,3,4}; f(2) = 4; f(1) =3. 


. There are four different functions f : {a,b} — {0,1}. List them all. 


fi = {(a,0),(6,0)} fo = {(a,1),(6,0)}, fs = ((a,0),(6,1)} fa = ((a, 1), (6, 1} 


. Give an example of a relation from {a,b,c,d} to {d,e} that is not a function. 


One example is {(a,d),(a,e),(b,d),(c,d),(d,d)}. 


. Consider the set f = {(x,y)e Zx Z:3x+y=4}. Is this a function from Z to Z? 


Explain. 
Yes, since 3x+ y=4 if and only if y = 4— 3x, this is the function f : Z— Z defined 
as f(x) =4- 3x. 


. Consider the set f = {(x?,x):x eR}. Is this a function from R to R? Explain. 


No. This is not a function. Observe that f contains the ordered pairs (4,2) and 
(4,-2). Thus the real number 4 occurs as the first coordinate of more than one 
element of /. 


Is the set 0 = {((X,|X|):X ¢ Zs} a function? If so, what is its domain and range? 
Yes, this is a function. The domain is “(Z;). The range is {0,1,2,3,4,5}. 


Free PDF version [S)= 


350 Solutions 


Section 12.2 


1. 


Let A = {1,2,3,4} and B = {a,b,c}. Give an example of a function f : A — B that is 
neither injective nor surjective. 

Consider f = {(1,a),(2,a),(3,a),(4,a)}. Then f is not injective because f(1) = f(2). 
Also f is not surjective because it sends no element of A to the element ce B. 


. Consider the cosine function cos :R — R. Decide whether this function is injective 


and whether it is surjective. What if it had been defined as cos: R — [-1,1]? 

The function cos :R — R is not injective because, for example, cos(0) = cos(2z). It 
is not surjective because if b =5« R (for example), there is no real number for 
which cos(x) = b. The function cos: R — [-1,1] is surjective but not injective. 


. A function f : Z — Z is defined as f(n) = 2n+1. Verify whether this function is 


injective and whether it is surjective. 

This function is injective. To see this, suppose m,ne Z and f(m) = f(n). 

This means 2m+1=2n+1, from which we get 2m = 2n, and then m=n. 

Thus f is injective. 

This function is not surjective. To see this notice that f(n) is odd for all n € Z. 
So given the (even) number 2 in the codomain Z, there is no n with f(n) = 2. 


. A function f : Z x Z — Z is defined as f((m,n)) = 2n-—4m. Verify whether this 


function is injective and whether it is surjective. 

This is not injective because (0,2) 4 (—1,0), yet f((0,2)) = f((-1,0))=4. This is 
not surjective because f((m,n)) = 2n —4m = 2(n- 2m) is always even. If be Z is 
odd, then f((m,n)) # b, for all (m,n) € Zx Z. 


. Prove that the function f : R—{2}— R— {5} defined by f(x) = Saat is bijective. 


Proof. First, let’s check that f is injective. Suppose f(x) = f(y). Then 


Bx+1  5y+1 
x-2 y-2 
(5x+1)(y-2) = (5y+1)(x-2) 
5xy-10xt+y-2 = B5yx-l0y+x-2 
-10x+y = -10y+x 
lly = 11x 


you xX. 
Since f(x) = f(y) implies x = y, it follows that f is injective. 
Next we check that f is surjective. Take an arbitrary element b « R— {5}. We seek 


an x € R— {2} for which f(x) = 6, or bett = b. Solving this for x, we get: 
5x+1 = Od(x-2) 
5x+1 = bx-2b 
5x-xb = -2b-1 
x(5-b) = -2b-1. 


Since we have assumed b € R— {5}, the term (5-— b) is not zero, and we can divide 
5-—b 
Since f is both injective and surjective, it is bijective. a 


with impunity to get x = - . This is an x for which f(x) = b, so f is surjective. 


Richard Hammack Book of Proof 


11. 


13. 


15. 


17. 


351 


Consider the function 6: {0,1} x N— Z defined as 6(a,b) = (—1)%b. Is 0 injective? 
Is it surjective? Explain. 

First we show that @ is injective. Suppose 6(a, b) = 0(c,d). Then (—1)%b = (-1)°d. 
As b and d are both in N, they are both positive. Then because (—1)%b =(-1)°d, 
it follows that (—1)* and (—1)° have the same sign. Since each of (—1)* and (—1)° 
equals +1, we have (—1)* = (—1)°, so then (—1)%b = (—1)°d implies b= d. But also 
(—1)* = (-1)° means a and c have the same parity, and because a,c € {0,1}, it 
follows a=c. Thus (a,b) = (c,d), so @ is injective. 

Next note that 6 is not surjective because 6(a,b) =(-1)%b is either positive or 
negative, but never zero. Therefore there exist no element (a,b) € {0,1} x N for 
which 0(a,b)=0€ Z. 


Consider the function f : R? — R? defined by the formula f(x,y) = (xy,x°). Is f 
injective? Is it surjective? 

Notice that f(0,1) = (0,0) and f(0,0) = (0,0), so f is not injective. To show that f 
is also not surjective, we will show that it’s impossible to find an ordered pair 
(x,y) with f(x,y) = (1,0). If there were such a pair, then f(x,y) = (xy,x°) = (1,0), 
which yields xy = 1 and x = 0. From x® = 0 we get x = 0, so xy = 0, a contradiction. 
This question concerns functions f : {A,B,C,D,E,F,G} — {1,2,3,4,5,6,7}. How 
many such functions are there? How many of these functions are injective? How 
many are surjective? How many are bijective? 

Function f can described as a list (f(A), f(B), f(C), f(D), F(E), f(F), f(@)), where 
there are seven choices for each entry. By the multiplication principle, the total 
number of functions f is 7’ = 823543. 

If f is injective, then this list can’t have any repetition, so there are 7! = 5040 
injective functions. Since any injective function sends the seven elements of the 
domain to seven distinct elements of the codomain, all of the injective functions 
are surjective, and vice versa. Thus there are 5040 surjective functions and 5040 
bijective functions. 


This question concerns functions f/ :{A,B,C,D,E,F,G}— {1,2}. How many such 
functions are there? How many of these functions are injective? How many are 
surjective? How many are bijective? 

Function f can described as a list (f(A), f(B), f(C), f(D), F(Z), fF), f(G@)), where 
there are two choices for each entry. Therefore the total number of functions 
is 2” = 128. It is impossible for any function to send all seven elements of 
{A,B,C,D,E,F,G} to seven distinct elements of {1,2}, so none of these 128 func- 
tions is injective, hence none are bijective. 

How many are surjective? Only two of the 128 functions are not surjective, and 
they are the “constant” functions {(A, 1),(B,1),(C,1),(D, 1), (EZ, 1), (F, 1),(G, 1)} and 
{(A, 2),(B,2),(C, 2), (D, 2), (EZ, 2),(F,2),(G,2)}. So there are 126 surjective functions. 


Free PDF version [S)= 


352 Solutions 


Section 12.3 


1. For any six integers, at least two have the same remainder when divided by 5. 


Proof. Let A be a set of six integers and B = {0,1,2,3,4}. Define f: A — B so 
that f(x) is the remainder when x is divided by 5. Because |A|=6>5 = |B|, the 
pigeonhole principle guarantees that f is not injective. Thus there are two 
integers x,y ¢A for which f(x) = f(y), meaning x and y have the same remainder 
when divided by 5. a 


3. For any six integers, 9 divides the sum or difference of two of them. 


Proof. Let A be a set of six integers. Let B = { {0}, {1,8}, {2,7}, {3,6}, {4,5}}. Notice 
that every element of B is a set that either has one element (0) or has two 
elements whose sum is 9. Define f : A — B so that f(x) is the set in B that 
contains the remainder when x is divided by 9. For example, f (12) = {3,6} and 
f (18) = {0}. Since 6 = |A| > |B| = 5, the pigeonhole principle implies that f is not 
injective. Thus there exist x,y ¢ A for which f (x) = f(y). Then either x and y 
both have the same reminder r when divided by 9, or the remainders r and 
s add to 9. In the first case x = 9m+r and y = 9n+r (for some m,n€ Z), so 9 
divides x — y= 9(m-—n). In the second case x = 9m +r and y=9n+s, so 9 divides 
xty=9m+9n+r+s=A9mt+n+1). B 


5. Any set of 7 integers contains a pair whose sum or difference is divisible by 10. 


Proof. Let A be any set of 7 integers. Let B = {{1,9}, {2,8}, {3, 7}, {4, 6}, {5}, {0}}. So B 
is a set of six sets; in particular, |B| = 6. The four 2-elements sets in B contain 
numbers that add to 10. Let f :A — B be the function for which f(x) equals the set 
in B that contains the remainder when x is divided by 10. (Examples: f(97) = {8, 7}, 
f (18) = {8,7}, f(12) = {2,8}, (230) = {0}, f(15) = {5}, etc.) Because |A| > |BI, the 
pigeonhole principle guarantees that f is not injective. Select two integers 
x,y€A for which f(x) = f(y). If x and y happen to have the same remainder when 
divided by 10, then their difference x— y is divisible by 10. If x and y and don’t 
have the same remainder when divided by 10, then f(x) = f(y) = {r,s} is one of the 
2-element sets in B. In this case, x= 10k+r and y=10¢+s (for integers k and @), 
while r+s=10. Then the sum x+y = (10k+r)+(10@+s) = 10k+102+10 = 10(k+@+1) 
is divisible by 10. a 


7. If X <{1,2,3,...,2n} and |X|>n, then one element of X divides another. 


Proof. Say X is as stated, and let Y = {1,3,5,...,2n—1} be the set of positive odd 
integers less than 2n; note |Y|=n. Any positive integer m can be factored as 
m = 2?q where q is the largest odd integer dividing m. For example, 100 = 2725, 
12 = 273, 8 = 231 and 13 = 313. Let f :X — Y be such that f(m) equals the largest 
odd integer dividing m. For example, f(100) = 25, f(12) = 3, f(8) = 1 and f(18) = 18. 
Because |X|>|Y|, the pigeonhole principle guarantees f is not injective. Thus 
there are two integers a,b «€ X for which f(a) = f(b). Say f(a)=q. Then a = 2?q¢ 
and b = 2"q for some p andr. If p<r thena|b. Ifr<p, then b|a. a 


Richard Hammack Book of Proof 


353 


Section 12.4 


1. 


Suppose A = {5,6,8}, B = {0,1}, C = {1,2,3}. Let f: A — B be the function f = 
{(5, 1), (6,0), (8, 1)}, and g:B —C be g= {((0,1),(1,1)}. Find gof. 
gof ={(5, 1), (6, 1), (8, D} 


. Suppose A = {1,2,3}. Let f: A — A be the function f = {(1,2),(2,2),(3,1)}, and let 


g:A—A be the function g = {(1,3), (2, 1),(3,2)}. Find gof and fog. 
gof ={0,)),(2,1),(3,3)}; fog ={C, 1), (2, 2), (3, 2)}. 


. If f,g:R—R are defined as f(x) = Vx+1 and g(x) =x°, find formulas for gof and 


fog. Answers: gof(x)=x+1; fog(x)=Vx3 +1 


. Consider the functions f,g : Zx Z — ZxZ defined as f(m,n) = (mn,m?) and 


g(m,n)=(m+1,m+n). Find the formulas for gof and fog. 
Note go f (m,n) = g(f(m,n)) = g(mn,m?) = (mn+1,mn+m?). 


Thus | go f(m,n)=(mn+1,mn+m?). 
Note fog(m,n) = f(g(m,n)) = f(m+1,m+n)=(m+1(m+n),(mt 1?”). 
Thus | fog(m,n)=(m2+mn+m+n,m?+2m +1). 


. Consider the functions f : Z x Z — Z defined as f(m,n)=m+n and g:Z—ZxZ 


defined as g(m)=(m,m). Find the formulas for gof and fog. 
gof(m,n)=(m+n,mt+n) fog(m)=2m 


Section 12.5 


1. 


Check that f :Z— Z defined by f(n) = 6—7 is bijective. Then compute f~!. 

It is injective: Suppose f(m) = f(n). Then 6-m =6-n, which reduces to m= n. 
It is surjective as follows. If b € Z, then f(6—b)=6-(6-b)=b. 

Inverse: f~'(n)=6-n. 


. Let B={2":neZ}={...,4,5,1,2,4,8,...}. Show that the function f : Z— B defined 


as f(n)= 2” is bijective. Then find f-!. 

It is injective: Suppose f(m) = f(n), which means 2” = 2”. Taking logs of both 
sides gives logo(2”) = log.(2”), which simplifies to m =n. 

The function f is surjective as follows. Suppose be B. By definition of B this 
means b = 2” for some ne Z. Then f(n) =2” =b. 

Inverse: f~1(n) = logg(n). 


. The function f :R—R defined as f(x) = 2x-e is bijective. Find its inverse. 


Inverse: f~1(x) = aay 


. Show that f :R? — R? where f((x, y) = (x? + 1)y,x?) is bijective. Find its inverse. 


First we prove the function is injective. Assume /f(x1,¥1) = f(x2, yo). Then (x? + 
1)y1 = (x2 + Lye and x3 = x3. Since the real-valued function f(x) =x? is one-to-one, 
it follows that x; = x9. Since x; = x9, and st +1>0 we may divide both sides of 
(x? + L)y1 = (x? + 1) y2 by (x? + 1) to get y1 = yo. Hence (x1, 91) = (x2, y2). 

Now we prove f is surjective. Let (a,b) € R?. Set x = b"? and y = a/(b?? +1). Then 
f(x, y) = ((b?8 + 1)%—, (b"8)) = (a,b). It now follows that f is bijective. 


b2/341? 


Free PDF version [S)= 


354 Solutions 


Finally, we compute the inverse. Write f(x,y) = (u,v). Interchange variables to 
get (x,y) = f(u,v) =((u2 + Iu,u*). Thus x =(u?+ Du and y=u?. Hence u = y? and 
v = —*—. Therefore f~1(x,y) = (u,v) = ye 


x 
y23 41° ? stig): 


. Consider the function f :RxN— Nx R defined as f(x, y) = (y,3xy). Check that this 


is bijective; find its inverse. 

To see that this is injective, suppose f(a, b) = f(c,d). This means (b, 3ab) = (d,3cd). 
Since the first coordinates must be equal, we get b = d. As the second coordinates 
are equal, we get 3ab = 3dc, which becomes 3ab = 3bc. Note that, from the 
definition of f, b€ N, sob #0. Thus we can divide both sides of 3ab = 3bc by the 
non-zero quantity 3b to get a=c. Now we have a =c and b =d, so (a,b) = (c,d). It 
follows that f is injective. 

Next we check that f is surjective. Given any (b,c) in the codomain N x R, notice 
that (35,6) belongs to the domain Rx N, and f(3,,6)=(b,c). Thus f is surjective. 
As it is both injective and surjective, it is bijective; thus the inverse exists. 

To find the inverse, recall that we obtained f(3,,6) = (b,c). Then ffs, 0) = 
f-'(6,c), which reduces to (35,5) = f-'(6,c). Replacing 6 and c with x and y, 
respectively, we get f~ (x,y) =(%,x). 


Section 12.6 


1. 


Consider the function f : R — R defined as f(x) = x2+3. Find f([-3,5]) and 
f-1((12, 191). Answers: f ([-3,5]) = [3,28]; f~1([12, 19]) = [—-4, -3]U[8, 41. 


. This problem concerns functions f : {1,2,3,4,5,6,7} — {0,1,2,3,4}. How many such 


functions have the property that |f~1({3})| =3? Answer: 44(3). 


. Consider a function f: A — B and a subset X CA. We observed in Section 12.6 


that f-1(f(X)) 4X in general. However X ¢ f~1(f(X)) is always true. Prove this. 


Proof. Suppose ae X. Thus f(a) € {f(x):x€ X} = f(X), that is f(a) € f(X). Now, by 
definition of preimage, we have f~1(f(X)) = {xe A: f(x) f(X)}. Since ac A and 
f(a) € f(X), it follows that a«¢ f-1(f(X)). This proves X ¢ f-(f(X)). | 


. Given a function f: A — B and subsets W,X CA, prove f(WnX)¢S f(W)n f(X). 


Proof. Suppose b « f(WnX). This means be€ {f(x):xeWnX}, that is 6 = f(a) 
for some ae WnX. Since ae W we have b = f(a) € {f(x):xe W} = f(W). Since 
ae X we have b= f(a)e {f(x):x¢€ X}= f(X). Thus b is in both f(W) and f(X), so 
be f(W)nf(X). This completes the proof that f(WnX)S f(W)n F(X). a 


. Given a function f: A — B and subsets W,X CA, prove f(WUX) = f(W)U F(X). 


Proof. First we will show f(WUX) ¢ f(W)uU f(X). Suppose b€ f(WUX). This 
means be {f(x):x€ WUX}, that is, b = f(a) forsomeae WUX. Thusae WoraeX. 
If ae W, then b= f(a)e {f(x):x€ W}= f(W). Ifae X, then b = f(a)e {f(x):x€ X}= 
f(X). Thus b is in f(W) or f(X), so be f(W)Uf(X). This completes the proof that 
fWuUX)cfW)uf(X). 


Richard Hammack Book of Proof 


11. 


13. 


355 


Next we will show f(W)u f(X)¢ f(WUX). Suppose be f(W)Uf(X). This means 
be f(W) or be f(X). If be f(W), then b = f(a) for some ae W. If be f(X), then 
b= f(a) for some ae X. Either way, b = f(a) for some a that is in W or X. That 
is, b= f(a) for some ae WUX. But this means b € f(WUX). This completes the 
proof that f(W)uU f(X) ¢ f(WUX). 

The previous two paragraphs show f(W UX) = f(W)Uf(X). a 


Given f :A—B and subsets Y,Z <B, prove f-'(Y UZ) =f -'(Y)uf-(Z). 


Proof, First we will show f-'(Y UZ) <¢ f-'(Y)uf-(Z). Suppose ae f- '(Y UZ). By 
Definition 12.9, this means f(a)_¢ YUZ. Thus, f(a)€ Y or f(a)eZ. If fae Y, 
then ae f-1(Y), by Definition 12.9. Similarly, if f(a)¢ Z, then ac f-(Z). Hence 
ace f \(Y)oraef-Z), soae f-(Y)Uf-(Z). Consequently f-\(Y UZ) ¢ f-'(Y)u 
frm) 


Next we show f-\(Y)Uf-'Z)c f-(Y UZ). Suppose ae f-(Y)Uf-\(Z). This 
means aeé f-(Y) or ae f-(Z). Hence, by Definition 12.9, f(a)<¢ Y or f(a)e Z, 
which means f(a)€ YUZ. But by Definition 12.9, f(a)« YUZ means ae f-'(Y UZ). 
Consequently f~'(Y)uf~'Z)¢ f- UY UZ). 

The previous two paragraphs show f~'(Y UZ) = f~\(Y)uf-(Z). | 


Let f:A—B be a function, and X cA. Prove or disprove: f(f 1¢))) = f(X). 


Proof. First we will show f(f-1(f(X))) ¢ f(X). Suppose y € f(f-Mf(X))). By 
definition of image, this means y = f(x) for some x € f~(f(X)). But by definition 
of preimage, x € f~'(f(X)) means f(x) € f(X). Thus we have y = f(x) € f(X), as 
desired. 

Next we show f(X)¢ f(f-1(f(X))). Suppose y« f(X). This means y = f(x) for 
some x¢X. Then f(x)=y«€ f(X), which means x« f~1(f(X)). Then by definition 
of image, f(x) € f(f-(f(X))). Now we have y= f(x) € f(f-(f(X))), as desired. 


The previous two paragraphs show f (f~'(f(X))) = f(X). a 


Chapter 13 Exercises 


Section 13.2 


1. 


Prove that lim (8x — 3) = 37. 
aes 


Proof. Take e>0. Note that |(8x—3)—37| = |8x—40| = |8(«—5)| = 8|x—5|. Soifé= ai 
then 0 <|x—5|<6 implies |(8x —3)—37| = 8|x —5| < 86 =85 =e. By Definition 13.2, 
lim (8x — 3) = 37. | 


. Prove that lim (x +2)=2. 
Puts 


Proof. Given e> 0, let 6 =e. Then 0 < |x—0|< 6 implies |(x + 2)—2| = |x-0|<6=<e. 
By Definition 13.2, him (x + 2)=2. a 
Par 


Free PDF version [S)= 


356 Solutions 


5. Prove that lim (x? —2) =7. 
es 


Proof. Suppose ¢ > 0. In what follows we will produce a corresponding 6 for 
which 0 < |x —3| <6 implies |(x? —2)—7| < e. Notice that 


\(x2 —2)—7| = |x? — 9| = |(x — 3)(x + 38)| = |x — 3] -|x +3}. 
If |x—3] < 1, then |x+3| = |(x—3)+6| < |x—3|+|6| < 1+6 = 7 (using the inequality (13.2) 
from page 245). So if |x-—3| <1, then |x+3|<7 and the above equation yields 
(x2 —2)—7| = |x -— 3] -|x+3] < |x-3|-7=7|x-3]. 


Take 6 to be smaller than both 1 and £. Then 0< |x-3/ <6 implies |(x? - 2)-7| < 
7-|x-3|<75<77 =e. By Definition 13.2, we have lim (x —2) = 7. 2 
pen 


Section 13.3 


1. Prove that lim log;9|x| does not exist. 
ase 


Proof. Suppose for the sake of contradiction that lim logy9|x| =L, for some LER. 
3 

Let <= 1, so there is a 6 > 0 for which 0 < |x—0| <6 implies |log,9(|x|) -L| < 1. 

Choose an x 40 for which |x| is smaller than both 6 and 104~!. Then 0 <|x-0|< 

6, 80 |logyg|x|-L| <1. But also |x| < 10%, so logyg|x| <L-—1. Consequently 

logyg |x| -L <—1, and thus |log,9|x|-L|>1. This is a contradiction. a 


3. Prove that lim 4 does not exist. 
es 


Proof. Suppose for the sake of contradiction that lim 5 =L, for some Le R. Fix 
a 

an ¢>0 for which L+<¢>0. Choose a real number 6 > 0 for which 0 < |x-0| <6 

implies |S —L|<e. Choose an x>0 that is smaller than both 6 and co Then 


0<|x-0|<6, so |5-L|<e. But also, « < \/74+., so x? < ;+. and hence 5 >Lte. 


Consequently 5 —L>e, and thus lS —L|>e. This is a contradiction. a 


5. Prove that lim xcot(+) does not exist. 
ra 


Proof. Note that cot(x) = cost Because sin(kz) = 0 for any k € Z, it follows that 
cot(x) is undefined for any x = kx. Hence xcot (4) is undefined for any x= 4. Given 
any 6 > 0, there exist values of x = x that satisfy 0 <|x-0|<6. The statement 
(0 <|x-0| <6) => |xcot(+)-L|<e is meaningless for such x, so the limit cannot 


exist. (See the remark following Example 13.5 on page 250.) a 


Section 13.4 
1. Given two or more functions /}, fo,...,fn, Suppose that lim fi(x) exists for each 


1<i<n. Prove that lim (filx)+ fol) +--+ fr(x)) = lim fi(x)+lim fo(x) ++ “+ lim fn (x). 


Richard Hammack Book of Proof 


357 


Proof. The proof is by induction. For the basis case n = 2, and the result follows 
from the sum rule (Theorem 13.5). 


Now let & > 2 assume that the theorem holds for k functions /1, fo,...,f,. That 
is, lim (fix) + fala) +--+ fp(x)) = lim f(x) + lim f2(x) +--+ + lim f,(x). We must show 


ra (fila) + fo(x)+-+++ fe (x)+ frsi(x)) = lim f1(x)+ lim f(x) +--+ + lim fg(x) + lim fy, +1 (2). 
Just note that 


lim (f(x) + fala) +--+ felx) + fi+10@)) 


= lim (f(x) + fo(x) +--+ fa(x)) + fa+i(x)) (group) 
= lim (f1(2) + falx)+ +--+ fC) + lim fer) (Theorem 13.7) 
= lim filx)+ lim fo(x)+-+-+ lim fp(x)+ lim fr+i(x) (inductive hypothesis). 
This completes the proof by induction. a 


3. Use the previous two exercises and the constant multiple rule (Theorem 13.4) to 
prove that that if f(x) is a polynomial, then lim f(x) = f(c) for any ce R. 


Proof. First note that by Exercise 2 and the identity function rule, we have 
lim x” = lim(x-x----x«)= (lim) . (tim-]---(lim x] =c-c::+-c=c”. Thus limx” = ce”. 
xc x->C XC XC Xe x—Cc 


Now consider an arbitrary polynomial f(x) = ay +.a1x+a9x2+---+anx”", where 


each a; is a constant real number. Then 
: _ |: 2 n 
lim f(x) = lim (ap + ax + agx“ +--+ +anx") 
w+ 2-*e 
= limag + limayx+limagx? +---+ lima,x” (Exercise 1) 
se a—-*¢ x—¢ xz—c 
=limag+a,limx+aglimx?+---+a,limx” (constant multiple rule) 
Bd xc ste xc 
=ag+ajct+agce7+-+-+anc” = f(c). | 


5. Prove that if lim f(x)=L and lim f(x)=M, then L=M. 


Proof. Suppose lim f(x) =L and lim f(x) = M. Then by limit laws, L- M = 
lim f(x)—lim f(x) = lim (f(«)- f(«)) = lim 0=0. This shows L-M=0,s0L=M. @ 
Section 13.5 
1. Prove that the function f(x) = \/x is continuous at any number c > 0. Deduce that 


lim g(x) = ,/lim g(x), provided lim g(x) exists and is greater than zero. 


Proof. Suppose c >0. Proving \/x is continuous at c amounts to proving that 
lim Vx = Vc. Here is a proof of this limit: For any <> 0 let 6 be smaller than both 


c and €\/c. Now suppose 0 < |x—c| <6. Because 6 < c it follows that |x-—c|<c, and 


Free PDF version [S)= 


358 Solutions 


hence —c <x-c<c. From this, 0 <x, so \/x exists. Also, because 6 < €\/c, we have 


7 _ vxtVJe| _ 1 |= 1 
|\¥x—vVe] = |(Vvx-ve) Taede (x ede = |x aor 
< | | : <6 Z Be ee = 
X= we ve = coe = 
(Note: above we used the fact /x+/c > Ve to get —1~ < £.) We have now 


Va-ve ~ Ve 
shown that 0 < |x—c| <6 implies | /x- /c|<e, so lim x = Jc. This means ,/x is 


continuous at any number x= c, by Definition 13.3. 
Applying Theorem 13.9, we get lim Va(x=, /lim g(x). | 


Section 13.6 
1. IfneN, then lim ¥ =0. 


Proof. Suppose e > 0. Let N = 7. Ifx>QN, then x" >N" =}, so0<4<e. 


Thus |4; -0| =|4|<e. In summary, x > N implies |4 -0|<e, so Jim jx = 0 by 


Definition 13.4. a 
3. IfaeR, then Jim a=a. 


Proof. Suppose e>0. Let N=1. Then x>WN implies |a-—a| <0, which means 
Jim a=a. (Note: The implication x > N > |a—a| <e is actually true no matter 


what value x has, because |a-a| <e« is automatically true.) a 
5. If both Jim f(x) and tim g(x) exist, then Jim (f(x) + g(x)) = Jim. f(x)+ iim g(x). 


Proof. Say Jim f(x) =L and Jim g(x) = M. We must prove Jim. (f(x)+g(x)) =L+M. 
Take ¢ > 0. We need to find an N for which x > N implies | (f(«)+g(x))-(L+M)|<e. 
Because lim f(x) = L, there is a N’ > 0 such that x > N’ implies |f(«)-L| < §. 


Because lim g(x) = M, there is a N" > 0 such that x > N” implies |g(x)-M| < ae 
Put N =max{N',N"}. Ifx>N, then 
| (f(x) + g(x)) -(L + M)| = |(f (x) -L) + (g(x)- M)| s |f(@)-L|+|g@)-M|<§+§=e. 
We’ve now shown that for any <> 0, there is a N > 0 for which x > N implies 
|(F@) + g@))-(L+ M)| <e. Thus lim (f(«)+ g(x)) =L+M. a 
7. If both Jim f(x) and Jim g(x) exist, then lim (f(x) - g(x)) = Jim, f(x) — lim g(x). 


Proof. Suppose both Jim f(x) and Jim. g(x) exist. Using exercises 4 and 5 above, 


Jim (f(x) - g(x) = lim (f(a) + (-1)- g()) = lim f(«)+ lim (-1)- g(x) 


= lim f(x) +(-1)- lim g(x) = lim f(x) — lim g(x). a 


Richard Hammack Book of Proof 


359 


9. If lim g(x)=L and f is continuous at x=L, then lim f(g(x)) = f (Jim g(x). 


Proof. Suppose iim g(x)=L and f is continuous at x = L. We need to prove 
Jim, f(g(x)) = f(L). Definition 13.4 says we must prove that for any «> 0, there is 
a corresponding N > 0 for which x > N implies |f (g(x)) —L| <e. 

So let e>0. As f is continuous at L, Definition 13.3 yields lim f(x) = f(L). From 


this, we know there is a real number 6 > 0 for which 
|x -—L| <6 implies |f(x)- f(Z)| <e. («) 


But also, from iim g(x) =L, we know that there is a real number N > 0 for which 


x >N implies |g(x)—L| <6. If x>N, then we have |g(x)-—L| <6, and from this («) 
yields |f (g(x)) -— f(L)|<e. Thus lim f (g(x)) = F(Z), and the proof is complete. 


Section 13.7 


1. Prove that {2 rf converges to 0. 


Proof. Observe that 0 < 2~ <4 4 for any n€N because 


2” 2-2-2---2-2-2 2 2 2 22 2 
nl n(n—I(n—2)-3-2-1 n n-1 n-1 32 1 
2 
ee eee eo ee 
n 
Thus |2; r| <j for any ne N. Given e>0, choose an integer N > 4. Ifn>N, then 
z of = | (|< 4<%<g =e. By Definition 13.5, {2 "| converges to 0. a 


3. Prove that {3 “3h diverges to oo. 


Proof. Note that an 1? or i> one = *2. For any L > 0, let N = 34. Then forn>N 
Qn24+1 ans 


we have > > > 2 =L. By Definition 13.6, the sequence diverges to co. 


5. Prove that {2241} converges to 2. 


Qn+1__ 2| _|3@n+1)_ 23n-1)|_| 5 |_ _5 
Proof. For n= 1 we have |3"*+ — $| = |3q"*5 — gqncp| = local = mcg: Given e>0, 
we will have 52, <« provided that 8 > +, orn> Be +4. 


Therefore, given any e > 0, take an sisber N> +4. Ifn>N, then | 


sg < ona < =e. By Definition 13.5, an converges to 2. 


ane] _ 2 
3 


5 
9(ge+3)-3 
7. Prove that if a sequence diverges to infinity, then it diverges. 


Proof. For the sake of contradiction, suppose that {a,,} diverges to oo, and {a,} 
converges to a number L. Definition 13.5 says that for «= 1 there is a number 


Free PDF version [S)= 


360 Solutions 


11. 


13. 


N > 0 for which n > N implies |a, —L| <1. Also, Definition 13.6 guarantees an 
N’' >0 for which n> N’' implies a, >L+1, that is,a,-L>1. 

Let n be larger than both N and N’. Then |a,—-L|<1anda,—-L>1. Thus 
lan -L| <1 and |a, —L|>1, a contradiction. a 


. Prove that if {a,} converges to L, and ce R, then {ca,} converges to cL. 


Proof. Suppose {a,,} converges to L, andc eR. If c =0, then {ca,} is the sequence 
0,0,0,0,..., and this converges to 0=cL. Thus the theorem is true if c = 0, so for 
the remainder of the proof we treat the case c 4 0. 


Let «> 0. Because {a,} converges to L, there exists an N > 0 for which n > N 
implies |a, -—L| < iq: Soifn>N, then |ca,—cL| =|c(a,—-L)|=Icl-|a,—-L| < Icl jp =e. 
In summary, we’ve shown that for any ¢ > 0, there is a N > 0 for which n>N 
implies |ca, -cL|<e. By Definition 13.5, {ca,} converges to cL. a 


Prove that if {a,} converges to L and {b,} converges to M, then the sequence 
{anb,} converges to LM. 


Proof. Suppose {a,} converges to L and {b,} converges to M. We must prove 
{anb,} converges to LM. To prove this, take «> 0. We need to find an N for which 
n>N implies |a,b, -LM|<e. With this in mind, notice that 
|dnbn -LM| = |(anbn —Lbp) + (Ldn - LM)| 
<|anb,-Lb,| + |Lb,-LM| 
= (an -L)bn| + [L(on-™)| 
=|a,-L]|-|b,|+IZ|-|b,-M|. («) 
Take N'>0 large enough so that n > N’ implies |b, -M|<1. Ifn>N’, then 
|b» | = |(6n -M)+M|<|b,-M|+|M| <1+|M|. 
Replacing |b,| in («) with the larger quantity 1+|M|, we get 
|anbn —LM|<|an—-L|-(1+|M|) +1L|-|b,-M| (*) 


for all n > N'. Now take N"">0 such that n > N" implies |a, —L|< g4qp. Take 
N" > 0 such that n > N’ implies |b, —L'| < gj. Put N = max{N',N",N'"}. If 
n> WN, then («*«) becomes 
E E E E 
—LIM| < ———_ -(1+|M]|)+|L|-—— ==+==e. 
lanbn —EM| < a agy (Lt MFI sat ae 


To summarize, we’ve shown that for any <> 0, there is a N>0 for which n>N 
implies |a,b, —LM|<e. Therefore {a,b,} converges to LM. a 


Prove that if {|a,,|} converges to 0, then {a,} converges to 0. Give an example of 
a sequence {a,,} for which {|a,|} converges to a number L #0, but {a,} diverges. 


Richard Hammack Book of Proof 


361 


Proof. Suppose {|a,|} converges to 0. This means that for any «> 0, there is an 
N >0 for which n> N implies ||a,|-0| <e. But |la,|-0|=|a,-0|. Thus n>N 
implies |a, —0|<e. Therefore {a,,} converges to 0. 


Consider the sequence {(-1)"}, which is —1,1,—1,1,-1.... This sequence diverges. 
But {|(-1)”|} is the sequence 1,1,1,1,..., which converges to 1. | 


Chapter 14 Exercises 
Section 14.1 


1. 


11. 


R and (0,00) 

Observe that the function f(x) = e* sends R to (0,00). It is injective because 
f(x) = f(y) implies e* = e’, and taking In of both sides gives x= y. It is surjective 
because if b € (0,00), then f(n(b)) = b. Therefore, because of the bijection f :R— 
(0,00), it follows that |R| = |(0,00)|. 


R and (0,1) 

Observe that the function f(x) = + cot" x) sends R to (0,1). It is injective and 
surjective by elementary trigonometry. Therefore, because of the bijection f : 
R— (0,1), it follows that |R| = |(0, 1)]. 


. A={8k:keZ} and B={7Tk:keEZ} 


Observe that the function f(x) = ix sends A to B. It is injective because f(x) = f(y) 
implies {x = 7y, and multiplying both sides by 3 gives x = y. It is surjective 
because if b « B, then b = 7k for some integer k. Then 3ke€ A, and f(3k)=7k=b. 
Therefore, because of the bijection f : A — B, it follows that |A| = |B]. 


. Zand S={...,4,4,1,1,2,4,8,16,...} 


Observe that ihe function f :Z—S defined as f(n) = 2” is bijective: It is injective 
because f(m) = f(n) implies 2” = 2”, and taking log, of both sides produces m = n. 
It is surjective because any element b of S has form b = 2” for some integer n, 
and therefore f(n) = 2” = 6. Because of the bijection f : Z — S, it follows that 
|Z| =|S|. 


. {0,1}xNandN 


Consider the function f : {0,1} x N— N defined as f(a,n) = 2n-a. This is injective 
because if f(a,n) = f(b,m), then 2n-a =2m—b. Now if a were unequal to b, one of 
a or b would be 0 and the other would be 1, and one side of 2n -a = 2m-—b would 
be odd and the other even, a contradiction. Therefore a = b. Then 2n-a=2m-b 
becomes 2n—a = 2m—a; add a to both sides and divide by 2 to get m =n. Thus we 
have a= b and m=n, so (a,n) =(b,m), so f is injective. To see that f is surjective, 
take any be N. If b is even, then b = 2n for some integer n, and f(0,n)=2n-0=6. 
If b is odd, then 6 = 2n+1 for some integer n. Then f(1,n+1) = 2(n+1)-1= 2n+1=b. 
Therefore f is surjective. Then f is a bijection, so |{0, 1} x N| = |N|. 


[0,1] and (0,1) 
Proof. Consider the subset X = {i :n€N}c[0,1]. Let f :[0,1] — [0, 1) be defined 


as f(x) =x if x¢[0,1]-X and f(+)= 4, for any +X. It is easy to check that 
f is a bijection. Next let Y = {1- 1 :n€N}<¢[0,1), and define g:[0,1) — (0,1) as 


Free PDF version [S)= 


362 Solutions 


13. 


15. 


g(x) =x if x€[0,1)-Y and g(1-+)=1-—, for any 1-4 €Y. As in the case of f, it 


is easy to check that g is a bijection. Therefore the composition gof :[0,1]— (0, 1) 
is a bijection. (See Theorem 12.2.) We conclude that |[0, 1]| = |(0,1)J. a 


YA(N) and A(Z) 

Outline: By Exercise 18 of Section 12.2, we have a bijection f :N — Z defined as 
f(n)= SUG Det, Now define a function ®: A(N) — A(Z) as D(X) = {f (x): 
x € X}. Check that @ is a bijection. 

Find a formula for the bijection f in Example 14.2. 

Hint: Consider the function f from Exercise 18 of Section 12.2. 


Section 14.2 


1. 


11. 


Prove that the set A = {In(n):n €N} CR is countably infinite. 
Note that its elements can be written in infinite list form as In(1), In(2), In(3),-:-. 
Thus A is countably infinite. 


. Prove that the set A = {(5n, -3n):n € Z} is countably infinite. 


Consider the function f : Z — A defined as f(n) = (5n,-3n). This is clearly surjec- 
tive, and it is injective because f(n) = f(m) gives (5n, -3n) = (5m,—3m), so 5n = 5m, 
hence m =n. Thus, because f is surjective, |Z| =|A|, and |A| =|Z| =o. Therefore 
A is countably infinite. 


. Prove or disprove: There exists a countably infinite subset of the set of irrational 


numbers. 
This is true. Just consider the set consisting of the irrational numbers 7,5, 3,45°"° 


. Prove or disprove: The set Q!° is countably infinite. 


This is true. Note Q1°° = Qx Qx---x Q (100 times), and since Q is countably infinite, 
it follows from the corollary of Theorem 14.5 that this product is countably 
infinite. 


. Prove or disprove: The set {0,1} x N is countably infinite. 


This is true. Note that {0,1} x N can be written in infinite list form as 
(0, 1), (1, 1), (0, 2), (1, 2), (0, 3), (1, 3), (0,4), (1, 4),---. Thus the set is countably infinite. 


Partition N into 8 countably infinite sets. 


For each i € {1,2,3,4,5,6,7,8}, let X; be those natural numbers that are congruent 
to i modulo 8, that is, 


X1 = {1,9,17,25,33,...} 
X_ = {2,10,18,26,34,...} 
X3 = {8,11,19,27,35,...} 
X4 = {4,12,20,28,36,...} 
X5 = {5,13,21,29,37,...} 
X¢ = {6,14,22,30,38,...} 


Richard Hammack Book of Proof 


363 


Xy (7,15, 13,31,39,...} 
Xg = {8,16,24,32,40,...} 


13. If A={X CN:X is finite}, then |A|=No. 


Proof. This is true. To show this we will describe how to arrange the items of 
A in an infinite list X1, Xo, X3, X4,.... 


For each natural number n, let p, be the nth prime number. Thus p; = 2, 
p2= 8, p3=5, pa=7, ps = 11, and so on. Now consider any element X «A. If 
X #@, then X = {n1,n2,n3,...,nz}, Where k = |X| and n; € N for each 1 <i <k. 
Define a function f : A — Nu {0} as follows: f({n1,n2,Nn3,...,.2z}) = PnyPng***Pny- 
For example, f({1,2,3}) = p1p2p3 = 2-3-5 = 30, and f({3,5}) = p3p5 =5-11=55, etc. 
Also, we should not forget that @¢ A, and we define f(@) = 0. 

Note f : A—NuU{0} is injective: Let X = {n1,n9,n3,...,.nz} and Y ={m 1,mo,msz,...,my}, 
and X #Y. Then there is an integer a that belongs to one of X or Y but not the 
other. Then the prime factorization of one of the numbers f(X) and f(Y) uses 
the prime number p, but the prime factorization of the other does not use pg. It 
follows that f(X) 4 f(Y) by the fundamental theorem of arithmetic. Thus / is 
injective. 


So each set X € A is associated with an integer f(X)= 0, and no two different sets 
are associated with the same number. Thus we can list the elements in X € A in 
increasing order of the numbers /(X). The list begins as 


D, {1}, {2}, {3}, {1,2}, {4}, {1,3}, {5}, {6}, {1,4}, {2,3}, {7}... 


It follows that A is countably infinite. a 


15. Hint: Use the fundamental theorem of arithmetic. 


Section 14.3 


1. Suppose B is an uncountable set and A is a set. Given that there is a surjective 
function f :A — B, what can be said about the cardinality of A? 
The set A must be uncountable, as follows. For each b € B, let a, be an element 
of A for which f(a,)= 6. (Such an element must exist because f is surjective.) 
Now form the set U = {a,:b¢€B}. Then the function f : U — B is bijective, by 
construction. Then since B is uncountable, sois U. Therefore U is an uncountable 
subset of A, so A is uncountable by Theorem 14.9. 

3. Prove or disprove: If A is uncountable, then |A| = |R|. 
This is false. Let A = “(R). Then A is uncountable, and by Theorem 14.7, 
IRI <| A(R) = |Al. 

5. Prove or disprove: The set {0,1} x R is uncountable. 


This is true. To see why, first note that the function f :R — {0} x R defined as 
f(x) = (0,x) is a bijection. Thus |R| = |{0} x R|, and since R is uncountable, so is 


Free PDF version COE 


364 Solutions 


{0}x R. Then {0} x R is an uncountable subset of the set {0,1} x R, so {0,1} x Ris 
uncountable by Theorem 14.9. 


7. Prove or disprove: If A¢B and A is countably infinite and B is uncountable, 
then B-A is uncountable. 


This is true. To see why, suppose to the contrary that B-—A is countably infinite. 
Then B= AU(B-A) is a union of countably infinite sets, and thus countable, by 
Theorem 14.6. This contradicts the fact that B is uncountable. 


Section 14.4 
1. Show that if A¢B and there is an injection g:B— A, then |A| = |B|. 


Just note that the map f:A — B defined as f(x) =x is an injection. Now apply 
the Cantor-Bernstein-Schréeder theorem. 


3. Let F be the set of all functions N — {0,1}. Show that |R| =|. |. 


Because |R| = |A(N)|, it suffices to show that |.¥| =|A(N)|. To do this, we will 
exhibit a bijection f :. ¥ — A(N). Define f as follows. Given a function ye F, 
let f(~) = {n EN: p(n) = 1}. To see that f is injective, suppose f(~) = f(@). Then 
{neN: p(n) = 1} ={neN:0(n)= 1}. Put X = {ne N: y(n) = 1}. Now we see that if 
née X, then p(n) =1=0(n). And if ne N-X, then y(n) = 0 = @(n). Consequently 
y(n) = O(n) for any neEN, so p=8. Thus f is injective. To see that f is surjective, 
take any X « A(N). Consider the function ye ¥ for which y(n) =1if ne X and 
y(n)=0ifn¢ X. Then f(~)=X, so f is surjective. 


5. Consider the subset B = {(x, y):x?+y? <1} ¢R?. Show that |B| =|R?|. 
This will follow from the Cantor-Bernstein-Schréeder theorem provided that we 


can find injections f :B — R? and g:R? — B. The function f : B — R? defined as 
f(x, y) = (x,y) is clearly injective. For g:R* — B, consider the function 


x y 
g(x,y) =| ——_, —==_ |. 
Vxr2ty241 Vx? 4+ y241 
Verify that this is an injective function g:R? — B. 
7. Prove or disprove: If there is a injection f : A — B and a surjection g:A — B, then 
there is a bijection h: A — B. 


This is true. Here is an outline of a proof. Define a function g’: B — A as follows. 
For each b € B, choose an element x, € g~({x}). (That is, choose an element x, ¢ A 
for which g(x,) = 6.) Now let g’: B — A be the function defined as g’(b) = x,. Check 
that g’ is injective and apply the Cantor-Bernstein-Schréeder theorem. 


Richard Hammack Book of Proof 


Index 


absolute convergence test, 268 
absolute value, 6, 245 
addition principle, 74 

and, 39 

axiom of foundation, 32 


basis step, 182 
biconditional statement, 46 
bijection, 270 

bijective function, 228 
byte, 66 


C(n,k), 85 

Cantor, Georg, 271 
Cantor-Bernstein-Schroéder theorem, 286 
cardinality, 4, 269 

Cartesian plane, 10 

Cartesian power, 10 

Cartesian product, 8 

ceiling of a number, 104 

closed interval, 7 

codomain of a function, 225 
Cohen, Paul, 289 
combinatorial proof, 108 
comparison test, 268 
complement of a set, 20 
composite number, 116 
composition of functions, 235 
conditional statement, 42 
conjecture, 173 

constructive proof, 154 
continuity, 256 

continuous function, 256 
continuum hypothesis, 289 
contrapositive, 128 
convergence of a sequence, 262 
convergence of a series, 266 
converse of a statement, 46, 128 


corollary, 114 
countable set, 275 
counterexample, 175 
counting, 65 


definition, 113 
DeMorgan’s laws, 51, 59 
difference of sets, 18 
differentiability, 257 
disproof, 172 

divergence of a sequence, 262 
divergence of a series, 266 
divergence test, 267 
divergence to oo, 264 
divides, 116 

division algorithm, 30, 117 
division principle, 104 
divisor, 116 

domain of a function, 225 
Doxiadis, Apostolos, 33 


element of a set, 3 
elimination, 63 

empty set, 4 

entries of a list, 65 
equality of functions, 227 
equality of lists, 66 
equality of sets, 3 
equivalence class, 211 
equivalence relation, 210 
equivalent statements, 149 
Euclid, 140, 166 

Euler, Leonhard, 133, 168 
existence theorem, 151 
existential quantifier, 54 
existential statement, 151 


factorial, 78 


366 


false, 35 
Fermat’s last theorem, 37 
Fermat, Pierre de, 37 
Fibonacci sequence, 193 
floor of a number, 104 
function, 224 
range of, 225 
bijective, 228, 269 
codomain of, 225 
composition of, 235 
continuous, 256 
derivative of, 257 
differentiable, 257 
domain of, 225 
equality, 227 
injective, 228 
inverse, 238 
notation, 227 
one-to-one, 228 
onto, 228 
surjective, 228 
function notation, 227 
fundamental theorem of arithmetic, 192 
fundamental theorem of calculus, viii 


gamma function, 84 
gcd, 116 
general term, 261 
geometric sequence, 195 
geometric series, 268 
Goldbach’s conjecture, 37, 58 
Goldbach, Christian, 37 
golden ratio, 195 
graph, 189 

cycle, 189 

edges, 189 

vertices, 189 
greatest common divisor, 116 


Hagy, Jessica, 33 
half-open interval, 7 
harmonic series, 268 


if-and-only-if theorem, 147 
image, 242 

inclusion-exclusion formula, 93 
index set, 26 

indexed set, 25 

induction, 180 


Richard Hammack Book of Proof’ 


Index 


strong, 187 
inductive hypothesis, 182 
inductive step, 182 
infinite interval, 7 
injection, 270 
injective function, 228 
integers, 3, 4 

congruence, 131, 207 

modulo n, 219 
intersection of sets, 18 
interval, 6 
inverse of a function, 238 
inverse relation, 239 
irrational number, 139 


Icm, 116 

least common multiple, 116 

lemma, 114 

length of a list, 65 

limit, 247 
at infinity, 259 
composition rule for, 257 
constant function rule for, 251 
constant multiple rule for, 252 
difference rule for, 252 
division rule for, 253 
identity function rule for, 251 
informal definition of, 246 
multiplication rule for, 253 
non-existence of, 249 
precise definition of, 247 
squeeze theorem for, 255 
sum rule for, 252 

limit comparison test, 268 

list, 65 
empty, 66 
entries, 65 
equal, 66 
length, 65 
non-repetitive, 69 
order, 65 
repetition, 69 

logic, 34 
contradiction, 137 
equivalence, 51 
inference, 63 
quantifier, 54 

existential, 54 
universal, 54 


symbols, 48, 53 
logical equivalence, 51 
logical inference, 63 


mean value theorem, 57 
Mersenne prime, 170 
modus ponens, 63 

modus tollens, 63 

multiple, 116 
multiplication principle, 69 
multiplicity, 96 

multiset, 96 


natural numbers, 4 
necessary condition, 44 
negation of a statement, 41 
non-constructive proof, 154 


one-to-one function, 228 
onto function, 228 

open interval, 7 

open sentence, 36, 56 
or, 40 

ordered pair, 8 

ordered triple, 10 


P(n,k), 81, 83 

Papadimitriou, Christos, 33 

parity, 115 

partial sum of a series, 265 

partition, 216 

Pascal’s triangle, 91 

Pascal, Blaise, 91 

perfect number, 165, 168 

permutation, 80 
k-permutation, 81, 83 


pigeonhole principle, 104, 233 


strong form, 104 
Pisano, Leonardo, 193 
power set, 15 
power, Cartesian, 10 
preimage, 242 
prime number, 37, 116 
proof 

by cases, 124 

by contradiction, 137 

by induction, 180 


by smallest counterexample, 191 


by strong induction, 187 


combinatorial, 108 
constructive, 154 
contrapositive, 128 
direct, 118, 118 
existence, 150 
involving sets, 157 
non-constructive, 154 
uniqueness, 150, 153 
within-a-proof, 143 
proposition, 114, 118 
Pythagorean theorem, 37 


quadratic formula, 37 
quantifier, 54 
quotient, 30, 117 


range of a function, 225 
ratio test, 268 

rational numbers, 6, 139 
real numbers, 4 


reflexive property of a relation, 205 


relations, 201 
between sets, 221 
equivalence, 210 

class, 211 
inverse, 239 
reflexive, 205 
symmetric, 205 
transitive, 205 

remainder, 30, 117, 131 

Russell’s paradox, 32 

Russell, Bertrand, 32 


sequence, 261 
convergence of, 262 
divergence of, 262 
divergence to oo, 264 
general term, 261 

series, 265 


367 


absolute convergence test for, 268 


comparison test for, 268 
convergence of, 266 
divergence of, 266 
divergence test for, 267 
geometric, 268 
harmonic, 268 


limit comparison test for, 268 


Maclaurin, 265 
partial sum of, 265 


Free PDF version 


@E 


368 


ratio test for, 268 
set(s) 
builder-notation, 5, 157 
cardinalities of 
comparison of, 280 
equal, 269 
unequal, 269 
cardinality of, 3, 269 
complement, 20 
countable, 275 
element of, 3 
empty, 4 
equal, 3 
partition of, 216 
subset of, 12 
uncountable, 275 
sigma notation, 25 
size, see cardinality 
statement, 35 
biconditional, 46 
conditional, 42 
necessary, 44 
sufficient, 44 
converse, 46 
equivalent, 149 
existential, 151 
negation, 59 
string, 66 
strong form of pigeonhole principle, 104 
strong induction, 187 
subset, 12 
subtraction principle, 76 
sufficient condition, 44 
surjection, 270 
surjective function, 228 
symmetric property of a relation, 205 


theorem, 113 
existence, 151 
if-and-only-if, 147 
three-dimensional space, 10 
transitive property of a relation, 205 
tree, 189 
triangle inequality, 245 
triple, ordered, 10 
true, 35 
truth 
table, 39 
value, 39 


Richard Hammack Book of Proof 


Index 


uncountable set, 275 
union of sets, 18 
uniqueness proof, 153 
unit circle, 14, 20 
universal quantifier, 54 
universal set, 20 


variable, 36 
vector space, 157 
Venn diagram, 22 


well-ordering principle, 30 
Wiles, Andrew, 37 
WLOG, 125 


Zermelo-Fraenkel axioms, 32 


